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STRUCTURE-BASED DRUG DESIGN FOR ULP1 PROTEASE SUBSTRATES 

This application claims priority under 35 U.S.C. §119 from provisional patent 
application Serial No. 60/205,336, filed May 18, 2000; which is hereby incorporated by reference 
in its entirety. 



FIELD OF THE INVENTION 

5* The present invention relates to rational drug design of cysteine protease inhibitors 

1 0 based on the crystal structure of a Ulp 1 protease trapped with its substrate in a covalent intermediate. 



n BACKGROUND OF THE INVENTION 



f - Ubiqui tin (Ub) and ubiqui tin-like (Ubl) proteins modulate protein function in the cell 

y through covalent modification of the target protein. Two proteins that have been identified as 
P 15 belonging to this family are ubiquitin and Small Ubiquitin-related Modifier (SUMO- 1). The Ub/Ubl 
conjugated state of cellular proteins has been associated with several critical pathways, including 
cellular differation, apoptosis, the cell-cycle, and cellular responses to stress (Hershko and 
Ciehanover, 1998, Annu. Rev. Biochem., 67:425; Hochstrasser, 1998, Genes and Develop., 12:901; 
Laney and Hochstrasser, 1999; Cell, 97:427, Saitoh, et ah, 1997, TIBS, 22:374; Johnson and 
20 Hochstrasser, 1 997, Trends. Cell. Biol., 7:408). Alterations in the regulation of the Ub/Ubl pathway 
has been implicated in several disease states, including tumorogenesis and acute promyelocyte 
leukemia (Hershko and Ciehanover, 1998, Annu. Rev. Biochem., 67:425; Kamitani et aL, 1998, J. 
Biol. Chem., 273:26675). 

Ub and Ubl proteins are covalently attached to their cellular targets via a reversible 
25 amide linkage between a lysine 8-amino group on the target and the C-terminus of the Ub/Ubl 
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protein. The fate of the modified protein depends on if it is conjugated to ubiquitin or SUMO-1. 
Ubiquinated and poly-ubiquinated proteins are generally targeted to the 26S proteosome for 
degradation in cell-cycle dependent and cell-cycle independent pathways. Sumoylated protein 
targets appear to modulate the activity of the targeted system either directly or indirectly by altering 
5 cellular localization. Cellular pathways that appear to be modulated by the SUMO pathway include 
activation of RanGAPl, p53 transcriptional regulation, and licba protection from ubiquitination 
(Matunis,^a/., 1996; J. Cell. Biol., 135:1457, Gostissa,*/ a/., 1999,EMBOJ., 18:6462; Rodriguez, 
et ah, 1999, EMBO J., 18:6455, Desterro, et al, 1998; Mol. Cell., 2:233). Ub/Ubl modifiers are 
generally deconjugated from their cellular targets by a class of cysteine proteases, known as 
1 0 deubiquinating (DUB) enzymes. 

Studies in yeast show that Smt3, a yeast member of the Ubl protein family, 
conjugation and deconjugation to proteins appears critical to several yeast functions, including septin 
ring formation, chromosomal segregation, and progression of the cell cycle. Smt3 shares 47% 
sequence identity with mammalian SUMO-1 and 17% sequence identity with ubiquitin. Recently, 
II 1 5 studies showed the identification of a novel Saccharomyces cerevisiae gene product termed Ubl- 
specific protease 1 (Ulp 1 ). In contrast to the ubiquitin DUB system, Ulp 1 catalyzes Smt3 processing 
and Smt3-protein deconjugation. 

There is a continuing need in the art to develop methods that enable one to evaluate 
interactions between proteases and their substrates. The development of an efficient method of 
2 20 trapping the protease and substrate in a complex, which would allow for the study of these 
interactions is needed. Such a method could allow for the development of compounds that could 
specifically interact with the amino acids necessary for protease function. 

U.S. Patent No. 5,834,228 discloses the use of rational drug design for developing 
inhibitors of the apopain that will form non-covalent interactions with active site, as based on the 
25 crystal structure of the protease and substrate. The reference, however, does not indicate how to trap 
the protease and substrate in a reaction intermediate state or use this method for identifying 
compounds that produce covalent interactions with the protease. Thus, there is a need in the art to 
define active site structures of cystine proteases complexed with their substrates. 
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SUMMARY OF THE INVENTION 

The present invention provides for a composition that is comprised of a polypeptide 
that comprises the catalytic domain of a SUMO protease and its substrate. In the composition, the 
SUMO protease is trapped in a deacylation intermediate complex with the substrate. 

The invention further provides for a method of forming a complex between a 
polypeptide that comprises the catalytic domain of a protease with its substrate by (1) combining the 
protease and substrate in a molar ratio; (2) adding a reducing agent, which is capable of trapping a 
proteolytic deacylation intermediate complex of the protease and substrate, in an amount that is 
effective to trap the protease and substrate and (3) adjusting the pH of the mixture to about 7.0. 

The present invention provides for polynucleotide sequence that encodes a mutant 
Ulpl . The mutant Ulpl may contain an amino acid substitution at position 432, 448, 451 472, 474, 
489, 490, 493, or 5 1 5. The invention also provides for vectors containing polynucleotide sequences 
of the mutant Ulp 1 , cells with the vectors containing the mutant Ulp 1 polynucleotide sequences, and 
the mutant Ulpl polypeptides. 

The present invention also advantageously provides a method of identifying potential 
substrates of cystine proteases by rational drug design. The method is comprised of designing 
candidate substrates that form interactions with catalytic amino acids which are identified from 
computer modeling studies based on the crystal structure of the complex of the protease and 
substrate. 

BRIEF DESCRIPTION OF DRAWINGS 
Figure 1A and IB. Stereo images of the Ulpl active site in complex with Smt3. 
Hydrogen bonds are represented as black spheres. Cys580, His541, and Asp351 represent the 
catalytic triad in the protease fold. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention advantageously provides an isolated, preferably purified, 
trapped protease-substrate complex, preferably a cysteine protease. In particular, protease- 
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substrate complexes of the invention are those of the sumo pathway, involved in many important 
cellular processes. 

The invention is based, in part, on trapping and crystallization of a Ulpl-Smt3 
covalent complex, from which the Ulpl crystal structure was obtained (coordinates are attached 
5 as Table 1 (after the specification and before the claims) and deposited in the Protein Data Bank 
with accession no. NC-001 148). This crystal structure permits the rational drug design of Ulpl 
and Ulpl ortholog inhibitions, which function to promote ubiquitin and ubiquitin-like protein 
activities in cells. 



10 General Definitions 

As used herein, the term "isolated" means that the referenced material is removed 
Q from the environment in which it is normally found. Thus, an isolated biological material can be 

fij free of cellular components, /. e. , components of the cells in which the material is found or produced. 

in 

In the case of nucleic acid molecules, an isolated nucleic acid includes a PCR product, an isolated 

* 

W 15 mRNA, a cDNA, or a restriction fragment. In another embodiment, an isolated nucleic acid is 
S| preferably excised from the chromosome in which it may be found, and more preferably is no longer 
f% joined to non-regulatory, non-coding regions, or to other genes, located upstream or downstream of 
the gene contained by the isolated nucleic acid molecule when found in the chromosome. In yet 
%| another embodiment, the isolated nucleic acid lacks one or more introns. Isolated nucleic acid 
H 20 molecules include sequences inserted into plasmids, cosmids, artificial chromosomes, and the like. 
Thus, in a specific embodiment, a recombinant nucleic acid is an isolated nucleic acid. An isolated 
protein may be associated with other proteins or nucleic acids, or both, with which it associates in 
the cell, or with cellular membranes if it is a membrane-associated protein. An isolated organelle, 
cell, or tissue is removed from the anatomical site in which it is found in an organism. An isolated 
25 material may be, but need not be, purified. 

The term "purified" as used herein refers to material that has been isolated under 
conditions that reduce or eliminate the presence of unrelated materials, i.e. , contaminants, including 
native materials from which the material is obtained. For exarhple, a purified protein is preferably 
substantially free of other proteins or nucleic acids with which it is associated in a cell; a purified 



nucleic acid molecule is preferably substantially free of proteins or other unrelated nucleic acid 
molecules with which it can be found within a cell. As used herein, the term "substantially free" is 
used operationally, in the context of analytical testing of the material. Preferably, purified material 
substantially free of contaminants is at least 50% pure; more preferably, at least 90% pure, and more 
5 preferably still at least 99% pure. Purity can be evaluated by chromatography, gel electrophoresis, 
immunoassay, composition analysis, biological assay, and other methods known in the art. 

Methods for purification are well-known in the art. For example, nucleic acids can 
be purified by precipitation, chromatography (including preparative solid phase chromatography, 
oligonucleotide hybridization, and triple helix chromatography), ultracentrifugation, and other 
10 means. Polypeptides and proteins can be purified by various methods including, without limitation, 
preparative disc-gel electrophoresis, isoelectric focusing, HPLC, reversed-phase HPLC, gel 
El filtration, ion exchange and partition chromatography, precipitation and salting-out chromatography, 

yj 

gj extraction, and countercurrent distribution. For some purposes, it is preferable to produce the 

%l polypeptide in a recombinant system in which the protein contains an additional sequence tag that 
^ 15 facilitates purification, such as, but not limited to, a polyhistidine sequence, or a sequence that 

S| specifically binds to an antibody, such as FLAG and GST. The polypeptide can then be purified 

f% from a crude lysate of the host cell by chromatography on an appropriate solid-phase matrix. 

'f 3 Alternatively, antibodies produced against the protein or against peptides derived therefrom can be 

%ssje 

SI used as purification reagents. Cells can be purified by various techniques, including centrifugation, 

yi 20 matrix separation (e.g., nylon wool separation), panning and other immunoselection techniques, 
depletion (e.g., complement depletion of contaminating cells), and cell sorting (e.g., fluorescence 
activated cell sorting [FACS]). Other purification methods are possible. A purified material may 
contain less than about 50%, preferably less than about 75%, and most preferably less than about 
90%, of the cellular components with which it was originally associated. The "substantially pure" 
25 indicates the highest degree of purity which can be achieved using conventional purification 
techniques known in the art. In a specific embodiment, the term "about" or "approximately" means 
within 20%, preferably within 10%, and more preferably within 5% of a given value or range. 

A "sample" as used herein refers to a biological material which can be tested for the 
presence of Ulpl protein or Ulpl nucleic acids. Such samples can be obtained from animal subjects, 
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such as humans and non-human animals, and include tissue, biopsies, blood and blood products; 
plural effusions; cerebrospinal fluid (CSF); ascites fluid; and cell culture. 

The term "about" or "approximately" means within an acceptable error range for a 
given measurement. In one aspect, the error range can be within 25% of a value, preferably within 
5 1 0%, and more preferably within 5%. Alternatively, particularly in biological systems, an acceptable 
error is one order of magnitude, preferably 2-fold. 

The use of italics indicates a nucleic acid molecule (e.g., Ulpl cDNA, gene, etc.); 
normal text indicates the polypeptide or protein. 



10 Cloning and Expression of mutant Ulpl 

The present invention contemplates generation of a gene encoding wild-type or a 
Q mutant Ulp 1 , including a full length and any antigenic fragments thereof from any source, preferably 
gj human. It further contemplates detection of mutant Ulpl protein for evaluation, diagnosis, or 
i| therapy. 

15 In accordance with the present invention there may be employed conventional 

( SJ molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g. , Sambrook, Fritsch & Maniatis, Molecular 
Cloning: A Laboratory Manual Second Edition ( 1 989) Cold Spring Harbor Laboratory Press, Cold 
H Spring Harbor, New York (herein "Sambrook et al , 1 989"); DNA Cloning: A Practical Approach, 
U 20 Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide Synthesis (MJ. Gait ed. 1984); Nucleic 
Acid Hybridization [B.D. Hames & S.J. Higginseds. (1985)]; Transcription And Translation [B.D. 
Hames & S.J. Higgins, eds. (1984)]; Animal Cell Culture [R.I. Freshney, ed. (1986)]; Immobilized 
Cells And Enzymes [IRL Press, (1986)]; B.Perbal,^ Practical Guide To Molecular Cloning (1984); 
F.M. Ausubel etal (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994). 
25 Molecular Biology - Definitions 

"Amplification" of DNA as used herein denotes the use of polymerase chain reaction 
(PCR) to increase the concentration of a particular DNA sequence within a mixture of DNA 
sequences. For a description of PCR see Saiki et ai 9 Science, 239:487, 1988. 



The polynucleotides encoding Ulpl herein may be flanked by natural regulatory 
(expression control) sequences, or may be associated with heterologous sequences, including 
promoters, internal ribosome entry sites (IRES) and other ribosome binding site sequences, 
enhancers, response elements, suppressors, signal sequences, polyadenylation sequences, introns, 
5'- and 3'- non-coding regions, and the like. The nucleic acids may also be modified by many means 
known in the art. Non-limiting examples of such modifications include methylation, "caps", 
substitution of one or more of the naturally occurring nucleotides with an analog, and internucleotide 
modifications such as, for example, those with uncharged linkages (e.g., methyl phosphonates, 
phosphotriesters, phosphoroamidates, carbamates, etc.) and with charged linkages (e.g., 
phosphorothioates, phosphorodithioates, etc.). Polynucleotides may contain one or more additional 
covalently linked moieties, such as, for example, proteins (e.g., nucleases, toxins, antibodies, signal 
peptides, poly-L-lysine, etc.), intercalators (e.g., acridine, psoralen, etc.), chelators (e.g., metals, 
radioactive metals, iron, oxidative metals, etc.), and alkylators. The polynucleotides may be 
derivatized by formation of a methyl or ethyl phosphotriester or an alkyl phosphoramidate linkage. 
Furthermore, the polynucleotides herein may also be modified with a label capable of providing a 
detectable signal, either directly or indirectly. Exemplary labels include radioisotopes, fluorescent 
molecules, biotin, and the like. 

The term M host cell" means any cell of any organism that is selected, modified, 
transformed, grown, or used or manipulated in any way, for the production of a substance by the cell, 
for example the expression by the cell of a gene, a DNA or RNA sequence, a protein or an enzyme. 
Host cells can further be used for screening or other assays, as described infra. 

Proteins and enzymes are made in the host cell using instructions in DNA and RNA, 
according to the genetic code. Generally, a DNA sequence having instructions for a particular 
protein or enzyme is "transcribed" into a corresponding sequence of RNA. The RNA sequence in 
turn is "translated" into the sequence of amino acids which form the protein or enzyme. An "amino 
acid sequence" is any chain of two or more amino acids. Each amino acid is represented in DNA 
or RNA by one or more triplets of nucleotides. Each triplet forms a codon, corresponding to an 
amino acid. For example, the amino acid lysine (Lys) can be coded by the nucleotide triplet or 
codon AAA or by the codon AAG. (The genetic code has some redundancy, also called degeneracy, 



meaning that most amino acids have more than one corresponding codon.) Because the nucleotides 
in DNA and RNA sequences are read in groups of three for protein production, it is important to 
begin reading the sequence at the correct amino acid, so that the correct triplets are read. The way 
that a nucleotide sequence is grouped into codons is called the "reading frame." 
5 A "coding sequence" or a sequence "encoding" an expression product, such as a 

RNA, polypeptide, protein, or enzyme, is a nucleotide sequence that, when expressed, results in the 
production of that RNA, polypeptide, protein, or enzyme, i.e., the nucleotide sequence encodes an 
amino acid sequence for that polypeptide, protein or enzyme. A coding sequence for a protein may 
include a start codon (usually ATG) and a stop codon. 
10 A "promoter sequence" is a DNA regulatory region capable of binding RNA 

polymerase in a cell and initiating transcription of a downstream (3' direction) coding sequence. For 
9 purposes of defining the present invention, the promoter sequence is bounded at its 3' terminus by 
g(J the transcription initiation site and extends upstream (5 1 direction) to include the minimum number 

m 

^ of bases or elements necessary to initiate transcription at levels detectable above background. 
5 II 15 Within the promoter sequence will be found a transcription initiation site (conveniently defined for 
SJ example, by mapping with nuclease S 1), as well as protein binding domains (consensus sequences) 
ri responsible for the binding of RNA polymerase. 

A coding sequence is "under the control" or "operatively associated with" of 
2*1 transcriptional and translational control sequences in a cell when RNA polymerase transcribes the 
!^ 20 coding sequence into mRNA, which is then trans-RNA spliced (if it contains introns) and translated 
into the protein encoded by the coding sequence. 

The terms "express" and "expression" mean allowing or causing the information in 
a gene or DNA sequence to become manifest, for example producing a protein by activating the 
cellular functions involved in transcription and translation of a corresponding gene or DNA 
25 sequence. A DNA sequence is expressed in or by a cell to form an "expression product" such as a 
protein. The expression product itself , e.g. the resulting protein, may also be said to be "expressed" 
by the cell. An expression product can be characterized as intracellular, extracellular or secreted. 
The term "intracellular" means something that is inside a cell. The term "extracellular" means 



something that is outside a cell. A substance is "secreted" by a cell if it appears in significant 
measure outside the cell, from somewhere on or inside the cell. 

The term "transfection" means the introduction of a foreign nucleic acid into a cell. 
The term "transformation" means the introduction of a "foreign" (i.e. extrinsic or extracellular) gene, 
DNA or RNA sequence to a host cell, so that the host cell will express the introduced gene or 
sequence to produce a desired substance, typically a protein or enzyme coded by the introduced gene 
or sequence. The introduced gene or sequence may also be called a "cloned" or "foreign" gene or 
sequence, may include regulatory or control sequences, such as start, stop, promoter, signal, 
secretion, or other sequences used by a cell's genetic machinery. The gene or sequence may include 
nonfunctional sequences or sequences with no known function. A host cell that receives and 
expresses introduced DNA or RNA has been "transformed" and is a "transformant" or a "clone." 
The DNA or RNA introduced to a host cell can come from any source, including cells of the same 
genus or species as the host cell, or cells of a different genus or species. 

The terms "vector", "cloning vector" and "expression vector" mean the vehicle by 
which a DNA or RNA sequence (e.g. a foreign gene) can be introduced into a host cell, so as to 
transform the host and promote expression (e.g. transcription and translation) of the introduced 
sequence. Vectors include plasmids, phages, viruses, etc.; they are discussed in greater detail below. 

Vectors typically comprise the DNA of a transmissible agent, into which foreign 
DNA is inserted. A common way to insert one segment of DNA into another segment of DNA 
involves the use of enzymes called restriction enzymes that cleave DNA at specific sites (specific 
groups of nucleotides) called restriction sites. A "cassette" refers to a DNA coding sequence or 
segment of DNA that codes for an expression product that can be inserted into a vector at defined 
restriction sites. The cassette restriction sites are designed to ensure insertion of the cassette in the 
proper reading frame. Generally, foreign DNA is inserted at one or more restriction sites of the 
vector DNA, and then is carried by the vector into a host cell along with the transmissible vector 
DNA. A segment or sequence of DNA having inserted or added DNA, such as an expression vector, 
can also be called a "DNA construct." A common type of vector is a "plasmid", which generally 
is a self-contained molecule of double-stranded DNA, usually of bacterial origin, that can readily 
accept additional (foreign) DNA and which can readily introduced into a suitable host cell. A 
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plasmid vector often contains coding DNA and promoter DN A and has one or more restriction sites 
suitable for inserting foreign DNA. Coding DNA is a DNA sequence that encodes a particular 
amino acid sequence for a particular protein or enzyme. Promoter DNA is a DNA sequence which 
initiates, regulates, or otherwise mediates or controls the expression of the coding DNA. Promoter 
5 DNA and coding DNA may be from the same gene or from different genes, and may be from the 
same or different organisms. A large number of vectors, including plasmid and fungal vectors, have 
been described for replication and/or expression in a variety of eukaryotic and prokaryotic hosts. 
Non-limiting examples include pKK plasmids (Clonetech), pUC plasmids, pET plasmids (Novagen, 
Inc., Madison, WI), pRSET or pREP plasmids (Invitrogen, San Diego, CA), or pMAL plasmids 
1 0 (New England Biolabs, Beverly, MA), and many appropriate host cells, using methods disclosed or 
cited herein or otherwise known to those skilled in the relevant art. Recombinant cloning vectors 
will often include one or more replication systems for cloning or expression, one or more markers 
for selection in the host, e.g. antibiotic resistance, and one or more expression cassettes. 
%l The term "expression system" means a host cell and compatible vector under suitable 

1 5 conditions, e.g. for the expression of a protein coded for by foreign DNA carried by the vector and 
introduced to the host cell. Common expression systems include E. coli host cells and plasmid 
vectors, insect host cells and Baculovirus vectors, and mammalian host cells and vectors. 

The term "heterologous" refers to a combination of elements not naturally occurring. 
For example, heterologous DNA refers to DNA not naturally located in the cell, or in a chromosomal 
20 site of the cell. Preferably, the heterologous DNA includes a gene foreign to the cell. A 
heterologous expression regulatory element is a such an element operatively associated with a 
different gene than the one it is operatively associated with in nature. In the context of the present 
invention, a Ulpl gene is heterologous to the vector DNA in which it is inserted for cloning or 
expression, and it is heterologous to a host cell containing such a vector, in which it is expressed, 
25 e.g., a yeast cell. 

The terms "mutant" and "mutation" mean any detectable change in genetic material, 
e.g. DNA, or any process, mechanism, or result of such a change. This includes gene mutations, in 
which the structure {e.g. DNA sequence) of a gene is altered, any gene or DNA arising from any 
mutation process, and any expression product {e.g. protein or enzyme) expressed by a modified gene 
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or DN A sequence. The term "variant" may also be used to indicate a modified or altered gene, DNA 
sequence, enzyme, cell, etc., i.e., any kind of mutant. 

"Sequence-conservative variants" of a polynucleotide sequence are those in which 
a change of one or more nucleotides in a given codon position results in no alteration in the amino 
5 acid encoded at that position. 

"Function-conservative variants" are those in which a given amino acid residue in a 
protein or enzyme has been changed without altering the overall conformation and function of the 
polypeptide, including, but not limited to, replacement of an amino acid with one having similar 
properties (such as, for example, polarity, hydrogen bonding potential, acidic, basic, hydrophobic, 
1 0 aromatic, and the like). Amino acids with similar properties are well known in the art. For example, 
arginine, histidine and lysine are hydrophilic-basic amino acids and may be interchangeable. 
Q Similarly, isoleucine, a hydrophobic amino acid, may be replaced with leucine, methionine or valine, 
ffi Such changes are expected to have little or no effect on the apparent molecular weight or isoelectric 
%^ point of the protein or polypeptide. Amino acids other than those indicated as conserved may differ 
HI 15 in a protein or enzyme so that the percent protein or amino acid sequence similarity between any two 
Sj proteins of similar function may vary and may be, for example, from 70% to 99% as determined 
%^ according to an alignment scheme such as by the Cluster Method, wherein similarity is based on the 
W MEG ALIGN algorithm. A "function-conservative variant" also includes a polypeptide or enzyme 

Sj which has at least 60 % amino acid identity as determined by BLAST or FASTA algorithms, 

dj 

f~ 20 preferably at least 75%, most preferably at least 85%, and even more preferably at least 90%, and 

spa 

which has the same or substantially similar properties or functions as the native or parent protein or 
enzyme to which it is compared. 

As used herein, the term "homologous" in all its grammatical forms and spelling 
variations refers to the relationship between proteins that possess a "common evolutionary origin," 
25 including proteins from superfamilies (e.g., the immunoglobulin superfamily) and homologous 
proteins from different species (e.g., myosin light chain, etc.) (Reeck et ai, 1987, Cell 50:667). 
Such proteins (and their encoding genes) have sequence homology, as reflected by their sequence 
similarity, whether in terms of percent similarity or the presence of specific residues or motifs at 
conserved positions. 
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Accordingly, the term "sequence similarity" in all its grammatical forms refers to the 
degree of identity or correspondence between nucleic acid or amino acid sequences of proteins that 
may or may not share a common evolutionary origin (see Reeck et al , supra). However, in common 
usage and in the instant application, the term "homologous," when modified with an adverb such as 
"highly," may refer to sequence similarity and may or may not relate to a common evolutionary 
origin. 

In a specific embodiment, two DNA sequences are "substantially homologous" or 
"substantially similar" when at least about 80%, and most preferably at least about 90 or 95%) of the 
nucleotides match over the defined length of the DNA sequences, as determined by sequence 
comparison algorithms, such as BLAST, F ASTA, DNA Strider, etc. An example of such a sequence 
is an allelic or species variant of the specific Ulpl genes of the invention. Sequences that are 
substantially homologous can be identified by comparing the sequences using standard software 
available in sequence data banks, or in a Southern hybridization experiment under, for example, 
stringent conditions as defined for that particular system. 

Similarly, in a particular embodiment, two amino acid sequences are "substantially 
homologous" or "substantially similar" when greater than 80% of the amino acids are identical, or 
greater than about 90% are similar (functionally identical). Preferably, the similar or homologous 
sequences are identified by alignment using, for example, the GCG (Genetics Computer Group, 
Program Manual for the GCG Package, Version 7, Madison, Wisconsin) pileup program, or any of 
the programs described above (BLAST, FASTA, etc) 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such as 
a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid molecule can 
anneal to the other nucleic acid molecule under the appropriate conditions of temperature and 
solution ionic strength (see Sambrook et aL, supra). The conditions of temperature and ionic 
strength determine the "stringency" of the hybridization. For preliminary screening for homologous 
nucleic acids, low stringency hybridization conditions, corresponding to a T m (melting temperature) 
of 55 °C, can be used, e.g. , 5x SSC, 0. 1 % SDS, 0.25% milk, and no formamide; or 30% formamide, 
5x SSC, 0.5% SDS). Moderate stringency hybridization conditions correspond to a higher T m , e.g., 
40% formamide, with 5x or 6x SCC. High stringency hybridization conditions correspond to the 



highest T m , e.g., 50% formamide, 5x or 6x SCC. SCC is a 0.15M NaCl, 0.0 15M Na-citrate. 
Hybridization requires that the two nucleic acids contain complementary sequences, although 
depending on the stringency of the hybridization, mismatches between bases are possible. The 
appropriate stringency for hybridizing nucleic acids depends on the length of the nucleic acids and 
5 the degree of complementation, variables well known in the art. The greater the degree of similarity 
or homology between two nucleotide sequences, the greater the value of T m for hybrids of nucleic 
acids having those sequences. The relative stability (corresponding to higher T m ) of nucleic acid 
hybridizations decreases in the following order: RNA.RNA, DNA:RNA, DNA:DNA. For hybrids 
of greater than 100 nucleotides in length, equations for calculating T m have been derived (see 
10 Sambrook et al, supra, 9.50-9.51). For hybridization with shorter nucleic acids, i.e., 
oligonucleotides, the position of mismatches becomes more important, and the length of the 

O oligonucleotide determines its specificity (see Sambrook et al., supra, 11.7-11.8). A minimum 

-.ft 

■ssjr 

yj length for a hybridizable nucleic acid is at least about 10 nucleotides; preferably at least about 15 
%l nucleotides; and more preferably the length is at least about 20 nucleotides. 
^ 15 In a specific embodiment, the term "standard hybridization conditions" refers to a T m 

SI of 55 °C, and utilizes conditions as set forth above. In a preferred embodiment, the T m is 60 °C; in 
a more preferred embodiment, the T m is 65 °C. In a specific embodiment, "high stringency" refers 

~? 

~; to hybridization and/or washing conditions at 68 °C in 0.2XSSC, at 42 °C in 50% formamide, 

M 4XSSC, or under conditions that afford levels of hybridization equivalent to those observed under 

£~[ 20 either of these two conditions. 

As used herein, the term "oligonucleotide" refers to a nucleic acid, generally of at 
least 10, preferably at least 15, and more preferably at least 20 nucleotides, preferably no more than 
1 00 nucleotides, that is hybridizable to a genomic DNA molecule, a cDNA molecule, or an mRNA 
molecule encoding a gene, mRNA, cDNA, or other nucleic acid of interest. Oligonucleotides can 
25 be labeled, e.g., with 32 P-nucleotides or nucleotides to which a label, such as biotin, has been 
covalently conjugated. In one embodiment, a labeled oligonucleotide can be used as a probe to 
detect the presence of a nucleic acid. In another embodiment, oligonucleotides (one or both of which 
may be labeled) can be used as PCR primers, either for cloning full length or a fragment of Ulpl, 
or to detect the presence of nucleic acids encoding Ulpl. In a further embodiment, an 
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oligonucleotide of the invention can form a triple helix with a UIpl DNA molecule. Generally, 
oligonucleotides are prepared synthetically, preferably on a nucleic acid synthesizer. Accordingly, 
oligonucleotides can be prepared with non-naturally occurring phosphoester analog bonds, such as 
thioester bonds, etc. 

5 

Ulpl Nucleic Acids 

A gene encoding mutant Ulp 1 , whether genomic DNA or cDN A, can be isolated from 
any source, particularly from a human cDNA or genomic library. Methods for obtaining Ulpl gene 
are well known in the art, as described above (see, e.g., Sambrook et al., 1989, supra). The DNA 
1 0 may be obtained by standard procedures known in the art from cloned DNA (e.g. , a DNA "library"), 
and preferably is obtained from a cDN A library prepared from tissues with high level expression of 
the protein, by chemical synthesis, by cDNA cloning, or by the cloning of genomic DNA, or 
fragments thereof, purified from the desired cell (See, for example, Sambrook et ai, 1989, supra\ 
Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, Ltd., Oxford, U.K. Vol. 

vxs. s 

- r y 15 I, II). Clones derived from genomic DNA may contain regulatory and intron DNA regions in 
SI addition to coding regions; clones derived from cDNA will not contain intron sequences. Whatever 

X 

p the source, the gene should be molecularly cloned into a suitable vector for propagation of the gene. 

Identification of the specific DNA fragment containing the desired Ulpl gene may be accomplished 
%j in a number of ways. For example, a portion of an Ulpl gene exemplified infra can be purified and 
P 20 labeled to prepare a labeled probe, and the generated DNA may be screened by nucleic acid 
hybridization to the labeled probe (Benton and Davis, Science 196:180, 1977; Grunstein and 
Hogness, Proc. Natl. Acad. Sci. U.S.A. 72:3961, 1975). Those DNA fragments with substantial 
homology to the probe, such as an allelic variant from another individual, will hybridize. 

Further selection can be carried out on the basis of the properties of the gene, e.g., if 
25 the gene encodes a protein product having the isoelectric, electrophoretic, amino acid composition, 
partial or complete amino acid sequence, antibody binding activity, or ligand binding profile of Ulpl 
protein as disclosed herein. Thus, the presence of the gene may be detected by assays based on the 
physical, chemical, immunological, or functional properties of its expressed product. 
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Other DNA sequences which encode substantially the same amino acid sequence as 
an Ulpl gene may be used in the practice of the present invention. These include but are not limited 
to allelic variants, species variants, sequence conservative variants, and functional variants. 

Amino acid substitutions may also be introduced to substitute an amino acid with a 
5 particularly preferable property. For example, a Cys may be introduced a potential site for disulfide 
bridges with another Cys. 

The genes encoding Ulpl derivatives and analogs of the invention can be produced 
by various methods known in the art. The manipulations which result in their production can occur 
at the gene or protein level. For example, the cloned Ulpl gene sequence can be modified by any 
10 of numerous strategies known in the art (Sambrook et aL, 1989, supra). The sequence can be 
cleaved at appropriate sites with restriction endonuclease(s), followed by further enzymatic 
modification if desired, isolated, and ligated in vitro. In the production of the gene encoding a 
derivative or analog of Ulpl, care should be taken to ensure that the modified gene remains within 
the same translational reading frame as the Ulpl gene, uninterrupted by translational stop signals, 
15 in the gene region where the desired activity is encoded. 

Additionally, the Ulpl encoding nucleic acid sequence can be mutated in vitro or in 



i vivo, to create and/or destroy translation, initiation, and/or termination sequences, or to create 

N> variations in coding regions and/or form new restriction endonuclease sites or destroy preexisting 

n ones, to facilitate further in vitro modification. Such modifications can be made to introduce 

20 restriction sites and facilitate cloning the Ulpl gene into an expression vector. Any technique for 
mutagenesis known in the art can be used, including but not limited to, in vitro site-directed 
mutagenesis (Hutchinson, C, et a/., J. Biol. Chem. 253:6551, 1978; Zoller and Smith, DNA 3:479- 
488, 1984; Oliphant et aL, Gene 44:177, 1986; Hutchinson et aL, Proc. Natl. Acad. Sci, U.S.A. 
83:710, 1986), use of TAB" linkers (Pharmacia), etc. PCR techniques are preferred for site directed 
25 mutagenesis (see Higuchi, 1989, "Using PCR to Engineer DNA", in PCR Technology: Principles 
and Applications for DNA Amplification, H. Erlich, ed., Stockton Press, Chapter 6, pp. 61-70). 

The identified and isolated gene can then be inserted into an appropriate cloning 
vector. A large number of vector-host systems known in the art may be used. Possible vectors 
include, but are not limited to, plasmids or modified viruses, but the vector system must be 
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compatible with the host cell used. Examples of vectors include, but are not limited to, E. coli, 
bacteriophages such as lambda derivatives, or plasmids such as pBR322 derivatives or pUC plasmid 
derivatives, e.g., pGEX vectors, pmal-c, pFLAG, etc. The insertion into a cloning vector can, for 
example, be accomplished by ligating the DNA fragment into a cloning vector which has 
complementary cohesive termini. However, if the complementary restriction sites used to fragment 
the DNA are not present in the cloning vector, the ends of the DNA molecules may be enzymatically 
modified. Alternatively, any site desired may be produced by ligating nucleotide sequences (linkers) 
onto the DNA termini; these ligated linkers may comprise specific chemically synthesized 
oligonucleotides encoding restriction endonuclease recognition sequences. 

Recombinant molecules can be introduced into host cells via transformation, 
transfection, infection, electroporation, etc., so that many copies of the gene sequence are generated. 
Preferably, the cloned gene is contained on a shuttle vector plasmid, which provides for expansion 
in a cloning cell, e.g., E. coli, and facile purification for subsequent insertion into an appropriate 
expression cell line, if such is desired. For example, a shuttle vector, which is a vector that can 
replicate in more than one type of organism, can be prepared for replication in both E. coli and 
Saccharomyces cerevisiae by linking sequences from an E. coli plasmid with sequences form the 
yeast 2/u plasmid. 

Expression of Ulpl Polypeptides 

The nucleotide sequence coding for Ulp 1 , or antigenic fragment, derivative or analog 
thereof, or a functionally active derivative, including a chimeric protein, thereof, can be inserted into 
an appropriate expression vector, i.e., a vector which contains the necessary elements for the 
transcription and translation of the inserted protein-coding sequence. Thus, a nucleic acid encoding 
Ulpl of the invention can be operationally associated with a promoter in an expression vector of the 
invention. Both cDNA and genomic sequences can be cloned and expressed under control of such 
regulatory sequences. Such vectors can be used to express functional or functionally inactivated 
Ulpl polypeptides. 
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The necessary transcriptional and translational signals can be provided on a 
recombinant expression vector, or they may be supplied by the native gene encoding Ulpl and/or 
its flanking regions. 

Potential host- vector systems include but are not limited to mammalian cell systems 
transfected with expression plasmids or infected with virus (e.g. , vaccinia virus, adenovirus, adeno- 
associated virus, herpes virus, etc.); insect cell systems infected with virus (e.g., baculovirus); 
microorganisms such as yeast containing yeast vectors; or bacteria transformed with bacteriophage, 
DNA, plasmid DNA, or cosmid DNA. The expression elements of vectors vary in their strengths 
and specificities. Depending on the host-vector system utilized, any one of a number of suitable 
transcription and translation elements may be used. 

Expression of Ulpl protein may be controlled by any promoter/enhancer element 
known in the art, but these regulatory elements must be functional in the host selected for expression. 
Promoters which may be used to control Ulpl gene expression include, but are not limited to, 
cytomegalovirus (CMV) promoter, the SV40 early promoter region (Benoist and Chambon, 1981, 
Nature 290:304-310), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto, et al, Cell 22:787-797, 1980), the herpes thymidine kinase promoter (Wagner et al, 
Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445, 1981), the regulatory sequences of the metallothionein 
gene (Brinster et al, Nature 296:39-42, 1982); prokaryotic expression vectors such as the p- 
lactamase promoter (Villa-Komaroff, et aL, Proc. Natl. Acad. Sci. U.S.A. 75:3727-3731, 1978), or 
the tac promoter (DeBoer, et ai, Proc. Natl. Acad. Sci. U.S.A. 80:21-25, 1983); see also "Useful 
proteins from recombinant bacteria" in Scientific American, 242:74-94, 1980; promoter elements 
from yeast or other fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, 
PGK (phosphoglycerol kinase) promoter, alkaline phosphatase promoter; and transcriptional control 
regions that exhibit tissue specificity, particularly endothelial cell-specific promoters. 

Soluble forms of the protein can be obtained by collecting culture fluid, or 
solubilizing inclusion bodies, e.g., by treatment with detergent, and if desired sonication or other 
mechanical processes, as described above. The solubilized or soluble protein can be isolated using 
various techniques, such as polyacrylamide gel electrophoresis (PAGE), isoelectric focusing, 
2-dimensional gel electrophoresis, chromatography (e.g., ion exchange, affinity, immunoaffinity, 
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and sizing column chromatography), centrifugation, differential solubility, immunoprecipitation, or 
by any other standard technique for the purification of proteins. 

Vectors 

A wide variety of host/expression vector combinations may be employed in 
expressing the DNA sequences of this invention. Useful expression vectors, for example, may 
consist of segments of chromosomal, non-chromosomal and synthetic DNA sequences. Suitable 
vectors include derivatives of SV40 and known bacterial plasmids, e.g., E. coli plasmids col El, 
pCRl, pBR322, pMal-C2, pET, pGEX (Smith et al, Gene 67:31-40, 1988), pMB9 and their 
derivatives, plasmids such as RP4; phage DNAS, e.g., the numerous derivatives of phage 1, e.g., 
NM989, and other phage DNA, e.g., Ml 3 and filamentous single stranded phage DNA; yeast 
plasmids such as the 2pt plasmid or derivatives thereof; vectors useful in eukaryotic cells, such as 
vectors useful in insect or mammalian cells; vectors derived from combinations of plasmids and 
phage DNAs, such as plasmids that have been modified to employ phage DNA or other expression 
control sequences; and the like. 

Preferred vectors for introducing Ulpl coding sequences into mammalian or insect 
cells are viral vectors, such as lentiviruses, retroviruses, herpes viruses, adenoviruses, adeno- 
associated viruses, vaccinia virus, baculovirus, and other recombinant viruses with desirable cellular 
tropism. Thus, a gene encoding a functional or mutant Ulp 1 protein or polypeptide domain fragment 
thereof can be introduced using a viral vector or through direct introduction of DNA. 

Viral vectors commonly used for targeting procedures are DNA -based vectors and 
retroviral vectors. Methods for constructing and using viral vectors are known in the art {see, e.g., 
Miller and Rosman, BioTechniques, 7:980-990, 1992). Preferably, the viral vectors are replication 
defective, that is, they are unable to replicate autonomously in the target cell. In general, the genome 
of the replication defective viral vectors which are used within the scope of the present invention 
lack at least one region which is necessary for the replication of the virus in the infected cell. These 
regions can either be eliminated (in whole or in part), be rendered non-functional by any technique 
known to a person skilled in the art. These techniques include the total removal, substitution (by 
other sequences, in particular by the inserted nucleic acid), partial deletion or addition of one or more 
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bases to an essential (for replication) region. Such techniques may be performed in vitro (on the 
isolated DNA) or in situ, using the techniques of genetic manipulation or by treatment with 
mutagenic agents. Preferably, the replication defective virus retains the sequences of its genome 
which are necessary for encapsidating the viral particles. 

DNA viral vectors include an attenuated or defective DNA virus, such as but not 
limited to herpes simplex virus (HSV), papillomavirus, Epstein Barr virus (EBV), adenovirus, 
adeno-associated virus (AAV), and the like. Defective viruses, which entirely or almost entirely lack 
viral genes, are preferred. Defective virus is not infective after introduction into a cell. Use of 
defective viral vectors allows for administration to cells in a specific, localized area, without concern 
that the vector can infect other cells. Thus, a specific tissue can be specifically targeted. Examples 
of particular vectors include, but are not limited to, a defective herpes virus 1 (HSV 1) vector (Kaplitt 
et al , Molec . Cell . Neurosci . 2 : 3 20-33 0,1991), defective herpes virus vector lacking a glyco-protein 
L gene (Patent Publication RD 371005 A), or other defective herpes virus vectors (International 
Patent Publication No. WO 94/21807, published September 29, 1994; International Patent 
Publication No. WO 92/05263, published April 2, 1994); an attenuated adenovirus vector, such as 
the vector described by Stratford-Perricaudet et al (J. Clin. Invest. 90:626-630, 1992; see also La 
Salle et al., Science 259:988-990, 1993); and a defective adeno-associated virus vector (Samulski 
etalj. Virol. 61:3096-3101, 1987; Samulski etal.,3. Virol. 63:3822-3828, 1989; Lebkowski etal, 
Mol. Cell. Biol. 8:3988-3996, 1988). 

Various companies produce viral vectors commercially, including but by no means 
limited to Avigen, Inc. (Alameda, CA; AAV vectors), Cell Genesys (Foster City, CA; retroviral, 
adenoviral, AAV vectors, and lentiviral vectors), Clontech (retroviral and baculoviral vectors), 
Genovo, Inc. (Sharon Hill, PA; adenoviral and AAV vectors), Genvec (adenoviral vectors), 
IntroGene (Leiden, Netherlands; adenoviral vectors), Molecular Medicine (retroviral, adenoviral, 
AAV, and herpes viral vectors), Norgen (adenoviral vectors), Oxford BioMedica (Oxford, United 
Kingdom; lentiviral vectors), and Transgene (Strasbourg, France; adenoviral, vaccinia, retroviral, 
and lentiviral vectors). 

In another embodiment, the vector can be introduced by lipofection, as naked DNA, 
or with other transfection facilitating agents (peptides, polymers, etc.). Synthetic cationic lipids can 
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be used to prepare liposomes for in vivo transfection of a gene encoding a marker (Feigner, et. al. 9 
Proc. Natl. Acad. Sci. U.S.A. 84:7413-7417, 1987; Feigner and Ringold, Science 337:387-388, 
1989; see Mackey, etal, Proc. Natl. Acad. Sci. U.S.A. 85:8027-8031, 1988; Ulmer, et ai, Science 
259:1745-1748, 1993). Useful lipid compounds and compositions for transfer of nucleic acids are 
5 described in International Patent Publications W095/18863 and W096/17823, and in U.S. Patent 
No. 5,459,127. Lipids may be chemically coupled to other molecules for the purpose of targeting 
(see Mackey, et al, supra). Targeted peptides, e.g., hormones or neurotransmitters, and proteins 
such as antibodies, or non-peptide molecules could be coupled to liposomes chemically. 

Other molecules are also useful for facilitating transfection of a nucleic acid, such as 
10 a cationic oligopeptide (e.g. , International Patent Publication W095/2 1931), peptides derived from 
DNA binding proteins (e.g., International Patent Publication WO96/25508), or a cationic polymer 
(e.g., International Patent Publication W095/21931). 

Alternatively, non- viral DNA vectors can be introduced into the desired host cells by 
methods known in the art, e.g. , electroporation, microinjection, cell fusion, DEAE dextran, calcium 
15 phosphate precipitation, use of a gene gun (ballistic transfection; see, e.g., U.S. Pat. No. 5,204,253, 
y U.S. Pat. No. 5,853,663, U.S. Pat No. 5,885,795, and U.S. Pat. No. 5,702,384 and see Sanford, 
O TIB-TECH, 6:299-302, 1988; Fynan etal, Proc. Natl. Acad. Sci. U.S.A., 90:1 1478-1 1482, 1993; 

U and Yang et aL, Proc. Natl. Acad. Sci. U.S.A., 87:1568-9572, 1990), or use of a DNA vector 

transporter (see, e.g., Wu, et al 9 J. Biol. Chem. 267:963-967, 1992; Wu and Wu, J. Biol. Chem. 
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20 263:14621-14624, 1988; Hartmut, et ai, Canadian Patent Application No. 2,012,31 1, filed March 
15, 1990; Williams, etal, Proc. Natl. Acad. Sci. USA 88:2726-2730, 1991). Receptor-mediated 
DNA delivery approaches can also be used (Curiel, et al. 9 Hum. Gene Ther. 3:147-154, 1992; Wu 
and Wu, J. Biol. Chem. 262:4429-4432, 1987). 
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Protease-Substrate Complex Formation 

The present invention describes a composition comprising a polypeptide of a SUMO 
protease catalytic domain in a trapped proteolytic deacylation intermediate complex with its 
substrate. The composition may then be treated with methods known in the art to produce a 
5 crystalline structure of the complex. The present invention contemplates characterization of the 
structure of any protease that produces deacylation intermediates upon interaction with a substrate, 
particularly any cysteine protease, e.g., by trapping the protease-substrate covalent intermediate. 
Specifically, the present invention contemplates a composition where the SUMO protease is Ulpl . 
Ulpl may be derived from various sources, including, but not limited to, mammal, yeast, insects, and 
10 plants. In one preferred embodiment, the Ulpl is derived from mammal, specifically human. The 
composition may include any substrate of Ulpl, such as SUMO and Smt3. In a specific 
7& embodiment, the substrate is Smt3. 

The complexed molecules in the composition are preferably obtained in a crystalline 
structure. The complexed molecules may be formed into a crystalline structure using techniques that 
1 5 are well known in the art, where the crystallizing conditions are modified and optimized for crystal 
formation of specific proteases and their substrate. Alternatively, other spectroscopic techniques for 
structure evaluation, such as nuclear magnetic resonance, Raman spectroscopy, circular dichroism, 
and neutron diffraction can be used to study trapped protease-substrate complexes. 

The coordinates of atoms and structure of the protease and substrate in the crystal 



l ?5 



^* 20 structure may be defined using any method and computational method needed. For example, if the 
protease and substrate complex are in a liquid form, then nuclear magnetic resonance can be used 
to determine the coordinates and structure of the complex. For complexes that are in a crystalline 
structure x-ray crystallography may be used to elucidate the structure. 

The invention also describes a method of preparatively trapping and purifying 
25 large amounts of the proteolytic deacylation intermediate using a reducing agent. The method of 
trapping the proteolytic deacylation intermediate involves (a) combining the protease and 
substrate in a molar ratio, (b) adding a reducing agent that is capable of trapping the substrate in 
an intermediate state with the protease, and (c) then adjusting the pH of the solution to about 7.0. 
The invention allows for performing steps (a) and (b) in any order or simultaneously. In a 
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specific embodiment the protease is a SUMO protease, and more specifically the SUMO protease 
is the catalytic domain of Ulpl. Ulpl may be derived from various sources, including, but not 
limited to, mammal, yeast, insects, and plants. In one preferred embodiment, the Ulpl is derived 
from mammal, specifically human. The complex includes any substrate of Ulpl, such as SUMO 
5 and Smt3. In a specific embodiment, the substrate is Smt3. 

The pH of the solution may be adjusted to about 7.0 by various methods known in 
the art, such as adding the required amount of an acid or base to the solution to adjust the pH to 
about 7.0. Other methods that may be used to adjust the pH of a solution, as appropriate for the 
system, include, but not limited to, chromatography and dialysis. In one embodiment of the 
10 invention, the pH of the solution is adjusted by dialysis. The protease-substrate complex may 
then be isolated from the solution after the pH is adjusted by dialysis. The protease-substrate 
% complex may be isolated from the solution using any methods known in the art, including high 
fjf performance liquid chromatography, size exclusion chromatography, dialysis, fast protein liquid 
51 chromatography, and crystallization. Other method of isolating the complex may be used, 

ft! 

jj 1 5 depending on the protease and substrate. 

^ The molar ratio of protease to substrate may encompass any ratio where the 

3 concentration of substrate is in excess to insure that the protease may form a complex with it. 

fH 

=i Theoretically, the substrate may be in infinite excess to the protease, however that would lead to 

=| significant waste of substrate that remains uncomplexed to the protease. In the present invention the 

^ 20 molar ratio of the protease to substrate ranges from 1 : 1 to 1 :5. More preferred, the Ulpl and Smt3 
are in a 1 :3 molar ratio. 

The protease and substrate are trapped in a deacylation complex by addition of a 
reducing agent in an amount that is effective to trap the proteins. An effective amount of reducing 
agent is defined as the amount needed to trap an isolatable amount of the complex in the solution. 
25 Any reducing agent is contemplated by the present invention. However, several reducing agents, 
such as lithium aluminum hydride, could reduce other portions of the protease and substrate in the 
complex. In one specific embodiment the reducing agent is sodium borohydride. 



Screening and Chemistry 
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According to the present invention, the structure of a Ulpl-Smt3 complex, or other 
trapped protease-substrate complexes, are useful to identify drugs that are effective in treating 
disorders associated with Ub/Ubl pathway, including tumorogenesis and acute promyelocytic 
leukemia. 

Rational Drug Design 

The invention also defines a method of identifying novel substrates for cysteine 
proteases by using rational drug design methods. The invention allows for designing substrates that 
would interact with amino acids in the protease catalytic site as defined by computer molecular 
modeling methods based on the crystal structure of the protease and other substrates. A substrate 
may include a competitive inhibitor, analog of the native substrate, or a suicide inhibitor. The 
designed substrates would be designed so that they interact with amino acid residues that are defined 
to be (i) necessary for interaction with a substrate and (ii) protease activity. 

The present invention contemplates evaluating potential compounds for covalent and 
non-covalent interactions between the protease and substrate. Computer modeling methods that may 
be used to evaluate these interactions include, but are not limited to, SYBYL and Monte Carlo 
computer programs. The present invention contemplates computer algorithms that evaluate bonded 
and non-bonded interactions between the protease and the substrate. Bonded interactions that may 
be evaluated include, but are not limited to, bond stretching, rotational strain, and torsional strain. 
Non-bonded interactions that may be evaluated include van Der Waals forces, hydrogen bonds and 
dipole-dipole interaction. 

In particular, identification of putative Ulp binding compounds provides for 
development of screening assays, particularly for high throughput screening of molecules that up- 
or down-regulate the activity of Ulpl, e.g., by permitting expression of Ulpl in quantities greater 
than can be isolated from natural sources, or in indicator cells that are specially engineered to 
indicate the activity of Ulpl expressed after transfection or transformation of the cells. 

Any screening technique known in the art can be used to screen for Ulpl agonists or 
antagonists. The present invention contemplates screens for small molecule ligands or ligand 
analogs and mimics, as well as screens for natural ligands that bind to and agonize or antagonize 
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Ulpl expression activity in vivo. For example, natural products libraries can be screened using 
assays of the invention for molecules that agonize or antagonize Ulpl expression or activity. 

Another approach uses recombinant bacteriophage to produce large libraries. Using 
the "phage method" (Scott and Smith, Science 249:386-390, 1990; Cwirla, et al , Proc. Natl Acad. 
5 Sci., 87:6378-6382, 1990; Devlin et al, Science, 49:404-406, 1990), very large libraries can be 
constructed (10 6 -10 8 chemical entities). A second approach uses primarily chemical methods, of 
which the Geysen method (Geysen et al , Molecular Immunology 23:709-715, 1986; Geysen et al 
J. Immunologic Method 102:259-274, 1987; and the method of YoAoxetal (Science 251:767-773, 
1 99 1 ) are examples. Furka et al ( 1 4th International Congress of Biochemistry , Volume #5, Abstract 
10 FR:013, 1988; Furka, Int. J. Peptide Protein Res. 37:487-493, 1991), Houghton (U.S. Patent No. 
4,631,21 1, issued December 1986) and Rutter et al (U.S. Patent No. 5,010,175, issued April 23, 
1991) describe methods to produce a mixture of peptides that can be tested as agonists or 
antagonists. 

Hi 

01 In another aspect, synthetic libraries (Needels et al, Proc, Natl. Acad. Sci. USA 

15 90:10700-4, 1993; Ohlmeyer etal, Proc. Natl. Acad. Sci. USA 90:10922-10926, 1993; Lam etal, 
International Patent Publication No. WO 92/00252; Kocis etal , International Patent Publication No. 
WO 9428028) and the like can be used to screen for Ulpl ligands according to the present invention. 

Test compounds are screened from large libraries of synthetic or natural compounds. 
Numerous means are currently used for random and directed synthesis of saccharide, peptide, and 
20 nucleic acid based compounds. Synthetic compound libraries are commercially available from 
Maybridge Chemical Co. (Trevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates 
(Merrimack, NH), and Microsource (New Milford, CT). A rare chemical library is available from 
Aldrich (Milwaukee, WI). Alternatively, libraries of natural compounds in the form of bacterial, 
fungal, plant and animal extracts are available from e.g. Pan Laboratories (Bothell, WA) or 
25 MycoSearch (NC), or are readily producible. Additionally, natural and synthetically produced 
libraries and compounds are readily modified through conventional chemical, physical, and 
biochemical means (Blondelle et al, Tib Tech, 14:60, 1996). 

Knowledge of the crystal structure of Ulpl, particularly when trapped with a 
substrate, can provide an initial clue as the inhibitors or antagonists of the protein. Moreover, based 
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on sequence and enzymatic activity similarities of Ulpl to other cysteine proteases, particularly cell- 
cycle associated cysteine proteases, the crystal structure of Ulpl permits development of classes of 
cysteine protease inhibitors. Identification and screening of antagonists is further facilitated by 
determining structural features of the protein, e.g. , using X-ray crystallography, neutron diffraction, 
5 nuclear magnetic resonance spectrometry, and other techniques for structure determination. These 
techniques provide for the rational design or identification of agonists and antagonists. 



In vivo screening methods 

Intact cells or whole animals expressing a gene encoding Ulpl can be used in 
10 screening methods to identify candidate drugs. 

In one series of embodiments, a permanent cell line is established. Alternatively, 
J" cells (including without limitation mammalian, insect, yeast, or bacterial cells) are transiently 

S programmed to express an Ulpl gene by introduction of appropriate DNA or mRN A. Identification 

CP of candidate compounds can be achieved using any suitable assay, including without limitation (i) 

RJ 

j£ 1 5 assays that measure selective binding of test compounds to Ulpl (ii) assays that measure the ability 

^ of a test compound to modify (/. e. , inhibit or enhance) a measurable activity or function of Ulp 1 and 

0 (iii) assays that measure the ability of a compound to modify (i.e., inhibit or enhance) the 

: U transcriptional activity of sequences derived from the promoter (i.e., regulatory) regions the Ulpl 
gene. 
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High-Throughput Screen 

Agents according to the invention may be identified by screening in high-throughput 
assays, including without limitation cell-based or cell-free assays. It will be appreciated by those 
skilled in the art that different types of assays can be used to detect different types of agents. Several 
25 methods of automated assays have been developed in recent years so as to permit screening of tens 
of thousands of compounds in a short period of time. Such high-throughput screening methods are 
particularly preferred. The use of high-throughput screening assays to test for agents is greatly 
facilitated by the availability of large amounts of purified polypeptides, as provided by the invention. 
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EXAMPLES 

The present invention will be better understood by reference to the following 
Examples, which are provided as exemplary of the invention, and not by way of limitation. 

5 Materials and Methods 

Protein purification 

A Ulpl amino acid fragment, from amino acids 403-621 (Ulp 1(403-621)) , was 
amplified by polymerase chain reaction. The PCR fragment was digested with Nhel and Xhol 
restriction endonucleases and then ligated into a pET-28b vector (Novagen). E. coli BL21 
10 (DE3)pLysS bacterial cells were transformed with Ulp 1(403-621). Cells were grown at 37° C for 
20-24 hours. Cell cultures then were grown at 37° C in superbroth, containing 50 mg/ml kanamycin 
y and 25 mg/ml chloamphenicol, by fermentation. When the A 600 of the culture reached about 2.0 the 

01 culture was cooled to 30° C, adjusted to 1 .0 mM IPTG, and fermented for about 4 hours at 30° C. 

if! 

m Remaining steps were performed at 4° C. The culture was centrifuged and the cell pellet was 

5 1 5 resuspended in 200 mL lysis buffer (50 mM Tris-HCl, 0. 1 5 mM NaCl, 20% (w/v) sucrose; pH = 8.0) 

% i and sonicated. Insoluble material was removed by centrifugation and the soluble fraction was placed 

£ 

C! over 1 0 mL of Ni-NTA-agarose resin (Qiagen) and washed with buffer A (50 mM Tris-HCl, 0.5 M 

in 

£7 NaCl, 20 mM imidazole). The protein was eluted from the resin with a gradient (20-300 mM 
^ imidazole) in buffer A. Peak fractions of His-tagged Ulp 1(403 -62 l)p were pooled and dialyzed 
fS 20 against 50 mM Tris-HCl (pH=8.0) containing 200 mM NaCl, 2 mM p-mercaptoethanol, and 100 
units bovine thrombin (Sigma) for about 20-24 hours. 

ASmt3 PCRfragment(Smt3(13-101)p) was digested withNcol andXhol restriction 
endonucleases and then ligated into a pET-28b vector. E. coli BL21 (DE3)pLysS bacterial cells 
were transformed with Smt3(13-101)p. Expression and purification of the protein was performed 
25 following the same protocol as described for Ulpl (403-62 1 )p, with the exception that the protein was 
dialyzed against 50 mM Tris-HCl (pH=8.0) containing 200 mM NaCl. 

Sumo (??-101)p was prepared in a similar manner to that described for Smt3(13- 

101)p. 
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Selenomethionyl-containing Ulp 1 (403-62 1 ) was expressed in DL4 1 cells (methionine 
auxotrophe) that contained the bacteriophage T7 polymerase (Hendrickson et al, EMBO J., 1990, 
9:1665-1672). 

The Smt3-GFPuv fusion protein was constructed by ligating SMT3 into the pGFPuv 
5 vector (Clontech). Smt3-GFPuv was overexpressed in JM109 E. coli strain. Purification was 
attained by standard chromatographic techniques and included passes over size exclusion, anion- 
exchange resins, and finally, application to a MonoQ column (Pharmacia). The Smt3-GFP moiety 
was followed during purification by analyzing fluorescence over a standard UV transilluminator. 

His6-ubiquitin-Smt3-HA was expressed from pQE30 (Li and Hochstrasser) in JM1 09 
10 E. coli. The histidine-tagged fusion protein (His6-ubiquitin-Smt3 -HA) was placed over Ni-agarose 
resin, washed and eluted as previously described. The pooled fractions were then placed onto a gel 
"% filtration column (Superdex 75 prep-grade) for size exclusion. These fractions were pooled and 
Sdj placed onto a MonoQ column (Pharmacia) and eluted with a 0.02- 1 M gradient of NaCl. Smt3-GFP, 
yl Sumo-His, and His6-ubiquitin-Smt3-HA were concentrated to -10 mg/ml and frozen at -80° C. 
l t 15 

^ Proteolysis assays 

O Proteolysis assays were prepared by incubating a solution at about 37° C that 

HI 

[7 contained about 1 .0 mg/ml substrate and about 1 0~ 3 -10* 6 mg/ml Ulp 1(403 -62 l)p in buffer containing 
^ 20mM Tris-HCl (pH = 8.0) and 1 50mM NaCl. The Smt3-GFP reaction was subsequently analyzed 
M 20 by native-gel electrophoresis. The reactions containing Sumo-His and His6-ubiquitin-Smt3-HA 
were analyzed by SDS-PAGE. 



Synthesis of covalent adduct between UlpU403-621)p and Smt3(13-101)p 

Ulp 1 (403 -62 1 )p and Smt3 ( 1 3 - 1 0 1 )p were placed in a beaker at a 1 : 3 molar ratio with 
25 stirring. Then 5 aliquots of sodium borohydride were stirred into the beaker over 30 minutes to a 
final concentration of 50 mM. The mixture was dialyzed against 20 mM Tris-HCl (pH=8.0) 
containing 20 mM NaCl at 4° C. The dialyzed solution was then loaded onto a Mono-Q column 
(Qiagen). The Ulpl (403-62 1 )p -Smt3( 1 3-1 01 )p complex was eluted from the column with 0.02-0.5 
MNaCl gradient. Fractions were analyzed by native gel electrophoresis. Ulp 1 (403-62 l)p -Smt3(13- 
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101)p was purified further by size exclusion chromatography on a Superdex75 prep-grade column 
(Pharmacia). Fractions were pooled, concentrated to about 3.5 mg/mL, and stored in 20 mM Tris- 
HC1 containing 50 mM NaCl at -80° C. 

5 Crystallization and data collection 

Crystals were grown at 21° C by the hanging drop vapor diffusion method, 
Ulp 1(403 -62 l)p -Smt3(13-101)p complex was mixed with an equal volume of reservoir buffer 
containing 0.1 M MES (pH=6.5), 10% (w/v) polyethylene glycol 20000, and 3% (w/v) 1,6- 
hexandiol. Crystals were grown over several days to a size of 0.05x0.1x0.3 mm. Crystals were 
10 stabilized in reservoir solution for heavy atoms soaks and pre-incubated in reservoir solution plus 
17% ethylene glycol. X-ray data used to collect native and heavy atom sets were collected using 
O crystals at a laboratory copper Ka source (Rigaku RU200) equipped with a confocal optics and a 
gj Raxis-IV imaging plate detector system. The high-resolution native data set and a four- wavelength 
%l MAD data set were collected at the National Synchrotron Light Source (Brookhaven, NY) at 
m 15 beamline X4A on an ADSC Quantum-4 detector. Data were processed using DENZO and 
SJ SC ALEP ACK. Ulp 1 (403-62 1 )p -Smt3 ( 1 3 - 1 0 1 )p were crystallized in space group P2 1 2 1 2 (a= 1 2 5 . 8 
L A, b-53.4 A, c=54.3 A, a=p=y=90°). 

M. 

Sf Structure determination and refinement 

Q 20 2.6 A electron density maps for the Ulp 1 (403-62 l)p -Smt3(13-101)p complex were 

initially derived using multiple isomorphous replacement (MIR) methods and solvent flattening 
(DM). Mercury sites were identified using Patterson methods and placed into MLPHARE for phase 
refinement. An initial trace into these maps was accomplished using the program O and the model 
was not refined. A selenomethionyl-derived Ulpl (403-625)p and native Smt3(l l-97)p crystal was 
25 subjected to a four-wavelength MAD experiment (Hendrickson, et ai, EMBO J., 1991, 9:1665- 
1672). The 1 .8 A MAD data was input into the program SOLVE (Terwilliger and Berendzen, Acta 
Crystallogr, 1999,055:849-861). Six selenium sites were located and used to generate 1.8 A phases. 
Solvent flattening was performed using the program DM (Cowtan, Joint CCP4 and ESF-EACBM 
Newsletter on Protein Crystallography, 1 994, 3 1 :34-38). The initial model derived from the MIR 
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electron density was placed into the MAD electron density maps for manual building. Refinement 
was accomplished using the programs REFMAC (Murshudov et al, Acta. Crystallogr., 1997, 
A50:157-163) and wARP/ARP (Lamzin and Wilson, Acta. Crystallogr., 1993, D49:129-149) for 
automated building and placement of waters. The model underwent three rounds of refinement at 
5 1.8 A with manual rebuilding between each round. Refinement was extended to 1 .6 A for another 
round. The current model contains 735 waters and 303 amino acids (Rfree=25.3, R=17.9) and has 
excellent geometry; 92.9% of residues are in the most favored region of the Ramachandran plot with 
none occurring in disallowed regions. The model contained all 219 amino acid residues of 
Ulp 1(403 -62 l)p plus two additional N-terminal amino acids that correspond to residues left behind 
10 after thrombin cleavage of the N-terminally HIS-tagged protein. Ulplp Arg422 did not have 
adequate density to model its side chain. The model for Smt3(13-98)p contains 79 of the possible 
87 amino acid residues. Electron density for N-terminal Smt3 residues 12 through 19 could not be 

5| observed and were left out of the model. 

HI 

~;l 15 Yeast plasmids and strains 

"~~J Standard techniques were used to grow and maintain yeast strains (Guthrie and Fink, 

Q Methods Enzymol., 1991; 194). Yeast strain W303-1A (MATa ura3-l ade2-l trpl-1 his3, -1115, 

111 

r; leu2-3, -112 canl-100) was used to generate a Ulpl null strain by one-step gene replacement 
jjj procedure (Rothstein, Methods Enzymol., 1991; 194:281-301). The resulting strain was termed 
p 20 Ulpl A strain W303ULP1 (MATa ura3-l ade2-l trpl-1 his3, -1115, leu2-3, -112 canl-100, 
Ulp l:\LEU2, pSE360-ULPl). In vivo complementation analysis was conducted using plasmid 
shaffle technique (Boeke et al, Methods Enzymol., 1987; 154,164). W303ULP1 was transformed 
with pSE3 5 8 plasmid containing either wildtype or mutant Ulp 1 under Ulp 1 natural promoter. Yeast 
cells were streaked onto agar plates that lack tryptophan and incubated at about 30° C until yeast 
25 colonies were visible. Colonies were then restreaked onto agar plates containing and 5-fluoro-orotic 
acid to induce the loss of pSE360-ULPl . 



Glacatose inducible expression 



10 
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Wildtype or deletion mutant Ulpl fragments were amplified by PCR. The fragments 
were digested with EcoRl and BamHl restriction endonucleases and ligated into PYX133 (CEN 
TRP1 GAL promoter; Ingenius). W303-1 A was transformed with the plasmid. Yeast strains were 
grown on plates lacking tryptophan, to select for cells that contain the plasmid of interest, at about 
30° C. Positive yeast colonies were then streaked onto SD(-Trp) agar plates, containing 2% (w/v) 
raffinose, with and without 0.01% (w/v) galactose. Cells were incubated at 30° C until a color 
change in the yeast colonies was observed. 

Point mutations were introduced to subcloned fragments using QuikChange Site- 
Directed Mutagenesis Kit (Qiagen), 



Results 

Ulpl (403-62 l)p displayed full catalytic activity in cleavage reactions with human 
SUMO-1 and yeast Smt3 to produce their respective mature protein forms (data not shown). 
Ulpl (403-62 1 )p also showed full catalytic activity in deconjugation reactions with Smt3-GFPuv and 
^ 15 His6-ubiquitin-Smt3-HA (data not shown), suggesting that the C-terminal fragment of Ulpl 
(Ulp 1 (403-62 1 )p) is able to (i) deconjugate large proteins and (ii) produce the mature forms of Smt3 
and SUMO. 

The secondary structure of Ulpl(403-621)p includes 7a helices and 70 strands. 
Ulp 1(403 -62 l)p exhibits structural similarities to other papain-like cysteine proteases in the active 
20 site, which includes the central a helix, 3 p strands, and the catalytic triad (Cy s-His-Glu/Asp) (Figure 

!)• 

The Ulp 1 (403-62 l)p-Smt3(13-98)p structure is currently refined to 1 .6 angstroms and 
includes all 219 amino acid residues of Ulp 1(403-62 l)p and 79 of the possible 87 amino acid 
residues of Smt3(13-98)p. Coordinates of the Ulpl(403-621)p-Smt3(13-98)p structure are shown 
25 in Table 1 (present before the claims). The interaction between Ulpl and Smt3 is described below 
in the context of six conserved Ulpl motifs. 

The interactions of motif 1 of Ulpl with Smt3 include side-chain to main-chain 
hydrogen bonding, Van der Waals (VDW) contacts, and one salt bridge between Arg438 of Ulpl 
and the Glu94 of Smt3. 
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Motif 2 of Ulpl is involved in the recognition of residues at the C terminus of Smt3, 
including VDW contacts with the Gly-Gly motif of Smt3 by Trp448. Motif 2 also contributes (i) 
3 hydrogen bonds to the main-chain of Smt3 strand 5 and (ii) 2 acidic residues (Glu455 and Asp45 1 ) 
that participate in salt-bridging interactions with 2 Smt3 basic residues (Arg64 and Arg71). 

Motif 3 of Ulpl provides direct hydrogen bonds to the main-chain of Smt3, Gly69, 
by a conserved Asn472. Hydrogen bonds are also made between Ser473 and Thr477 to an invariant 
Smt3 residue Gln95 in strand 5 . VDW contacts in the Smt3 -Ulp 1 interface is contributed by Phe474. 

Motif 4 of Ulpl contains salt bridging moieties (Arg489 and Arg493) that make 
contact with two acidic residues in Smt3 (Asp68 and Asp82). Positions 489 and 493 also produce 
hydrogen bonds through water to make interactions with Asp87 in Smt3 . Motif 4 contributes VDW 
contacts in the Ulpl-Smt3 interface through Trp490. 

Motif 5 of Ulpl contains His514, which is the general base of the catalytic triad. 
Motif 5 also participates in several interactions with the C-terminal tail of Smt3. Motif 5 contains 
Asn509 and Gln5 1 2, which participate in hydrogen bonds with main-chain atoms of Smt3 residues 
94 and 96. The carbonyl of Ser5 1 3 is involved in a main-chain to main-chain hydrogen bond with 
Smt3 Gly98. 

Motif 6 of Ulpl contains active site amino acid residues Cys580 and Gln574, which 
are involved in the function of the protein. 

The present invention is not to be limited in scope by the specific embodiments 
described herein. Indeed, various modifications of the invention in addition to those described 
herein will become apparent to those skilled in the art from the foregoing description and the 
accompanying figures. Such modifications are intended to fall within the scope of the appended 
claims. 

It is further to be understood that values are approximate, and are provided for 

description. 

Patents, patent applications, publications, procedures, and the like are cited 
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throughout this application, the disclosures of which are incorporated herein by reference in their 
entireties. 
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HYDROLASE 17-APR-00 1EUV 

X-RAY STRUCTURE OF THE C-TERMINAL ULP1 PROTEASE DOMAIN IN 

COMPLEX WITH SMT3, THE YEAST ORTHOLOG OF SUMO. 
MOL_ID: 1; 
MOLECULE: ULPl PROTEASE; 
CHAIN: A; 

FRAGMENT: C-TERMINAL PROTEASE DOMAIN; 
ENGINEERED: YES; 
MOL_ID: 2; 

MOLECULE: UBITQUTIN-LIKE PROTEIN SMT3; 
CHAIN: B; 

FRAGMENT: SMT3 RESIDUES 13-98; 
ENGINEERED: YES 
MOL_ID: 1; 

ORGANISM_SCIENTIFIC: SACCHAROMYCES CEREVISIAE; 
ORGAN I SM_COMMON : YEAST ; 
EXPRESSION_SYSTEM: ESCHERICHIA COLI; 
EXPRESSION_SYSTEM_COMMON: BACTERIA; 
EXPRESSION_SYSTEM_PLASMID: PET28B; 
MOL_ID: 2; 

ORGANISM_SCIENTIFIC: SACCHAROMYCES CEREVISIAE; 
ORGAN I SM_COMMON : YEAST ; 
EXPRESSION_SYSTEM: ESCHERICHIA COLI; 
EXPRESSION_SYSTEM_COMMON : BACTERIA; 
EXPRESSION_SYSTEM_PLASMID: PET28B 
SUMO HYDROLASE, UBIQUITIN-LIKE PROTEASE 1, SMT3 HYDROLASE 
DESUMOYLATING ENZYME, CYSTEINE PROTEASE, SUMO PROCESSING 
ENZYME, SMT3 PROCESSING ENZYME, NABH4, THIOHEMIACETAL, 
COVALENT PROTEASE ADDUCT 
X-RAY DIFFRACTION 
E . MOSSESSOVA, CD, LIMA 
07-JUN-00 1EUV 0 
AUTH E . MOSSESSOVA, CD. LIMA 

TITL ULP1-SUM0 CRYSTAL STRUCTURE AND GENETIC ANALYSIS 
TITL 2 REVEAL CONSERVED INTERACTIONS AND A REGULATORY 
TITL 3 ELEMENT ESSENTIAL FOR CELL GROWTH IN YEAST 
REF MOL. CELL V. 5 865 2000 

RE FN ASTM MOCEFL US ISSN 1097-2765 

REFERENCE 1 
AUTH S . J. LI , M . HOCHSTRASSER 

TITL A NEW PROTEASE REQUIRED FOR CELL-CYCLE PROGRESSION 
TITL 2 IN YEAST. 

REF NATURE V. 398 246 1999 

RE FN ASTM NATUAS UK ISSN 0028-0836 

RESOLUTION. 1.6 ANGSTROMS . 



REFINEMENT . 
PROGRAM 
AUTHORS 



REFMAC 

MURSHUDOV, VAGIN, DODSON 



DATA USED IN REFINEMENT . 
RESOLUTION RANGE HIGH (ANGSTROMS) 
RESOLUTION RANGE LOW (ANGSTROMS) 
DATA CUTOFF (SIGMA ( F) ) 

COMPLETENESS FOR RANGE (%) 
NUMBER OF REFLECTIONS 

FIT TO DATA USED IN REFINEMENT . 



1.60 
25.0 
1.000 
96. 6 
47560 
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REMARK 
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REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
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REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 



CROSS-VALIDATION METHOD 

FREE R VALUE TEST SET SELECTION 

R VALUE (WORKING + TEST SET) 

R VALUE (WORKING SET) 

FREE R VALUE 

FREE R VALUE TEST SET SIZE (%) 
FREE R VALUE TEST SET COUNT 

NUMBER OF NON-HYDROGEN ATOMS USED 
PROTEIN ATOMS : 2417 

NUCLEIC ACID ATOMS : 0 

HETEROGEN ATOMS : 0 

SOLVENT ATOMS : 4 32 



NULL 

5% OF THE OBSERVED 

DATA 

0.193 

0.190 

0.251 

NULL 

2378 

IN REFINEMENT. 



B VALUES. 
FROM WILSON PLOT 
MEAN B VALUE (OVERALL, 
OVERALL ANISOTROPIC B VALUE . 



A**2) 
A**2) 



20. 69 
NULL 



Bll 


(A**2) 


NULL 


B22 


(A**2) 


NULL 


B33 


(A**2) 


NULL 


B12 


(A**2) 


NULL 


B13 


(A**2) 


NULL 


B23 


(A**2) 


NULL 



ESTIMATED OVERALL COORDINATE ERROR. 
ESU BASED ON R VALUE 
ESU BASED ON FREE R VALUE 
ESU BASED ON MAXIMUM LIKELIHOOD 
ESU FOR B VALUES BASED ON MAXIMUM LIKELIHOOD 



(A) 
(A) 
(A) 
(A**2) 



NULL 
NULL 
NULL 
NULL 



RMS DEVIATIONS FROM IDEAL VALUES. 
DISTANCE RESTRAINTS. 

BOND LENGTH (A) 

ANGLE DISTANCE (A) 

INTRAPLANAR 1-4 DISTANCE (A) 

H-BOND OR METAL COORDINATION (A) 



PLANE RESTRAINT 
CHIRAL- CENTER RESTRAINT 



(A) 
(A**3) 



RMS 
0.013 
1.600 
NULL 
NULL 

NULL 
NULL 



SIGMA 

NULL 

NULL 

NULL 

NULL 

NULL 
NULL 



RESTRAINTS. 



NON-BONDED CONTACT 
SINGLE TORSION 
MULTIPLE TORSION 
H-BOND (X. . . Y) 
H-BOND (X-H. . . Y) 



CONFORMATIONAL TORSION ANGLE RESTRAINTS. 



(A) 


: NULL 


; NULL 


(A) 


: NULL 


; NULL 


(A) 


: NULL 


; NULL 


(A) 


: NULL 


; NULL 



SPECIFIED 
PLANAR 
STAGGERED 
TRANSVERSE 

ISOTROPIC THERMAL 
MAIN-CHAIN BOND 
MAIN-CHAIN ANGLE 
SIDE-CHAIN BOND 
SIDE-CHAIN ANGLE 



(DEGREES) 
(DEGREES) 
(DEGREES) 
(DEGREES) 

FACTOR RESTRAINTS. 

(A**2) 
(A**2) 
(A**2) 
(A**2) 



NULL 
NULL 
NULL 
NULL 



NULL 
NULL 
NULL 
NULL 



RMS 


SIGMA 


NULL 


; NULL 


NULL 


; NULL 


NULL 


; NULL 


NULL 


; NULL 



OTHER REFINEMENT REMARKS: USED A -LOGLIKELIHOOD RESIDUAL 
DERIVED FROM RICE DISTRIBUTION FOR CENTRIC AND ACENTRIC 
CASES OF FS SPARSE MATRIX PROCEDURE WITH ANISOTROPIC B- 
FACTOR REFINEMENT. 
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REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
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REMARK 
REMARK 
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4 
4 
6 
6 
6 
6 
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100 
100 
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200 
200 
200 
200 
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200 
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200 
200 
200 
200 
280 
280 
280 
280 
280 
280 



1EUV COMPLIES WITH FORMAT V. 2.3, 09-JULY-1998 

COVALENT ADDUCT FORMED BETWEEN THE PROTEOLYTIC 
ACTIVE SITE THIOL AND THE C-TERMINAL GLYCINE OF 
SMT3 USING THE REDUCING AGENT NABH4 . 

THIS ENTRY HAS BEEN PROCESSED BY RCSB ON 20-APR-2000 
THE RCSB ID CODE IS RCSB010911. 



EXPERIMENTAL DETAILS 
EXPERIMENT TYPE 
DATE OF DATA COLLECTION 
TEMPERATURE (KELVIN) 
PH 

NUMBER OF CRYSTALS USED 



(Y/N) 



(M/L) 
(A) 



SYNCHROTRON 
RADIATION SOURCE 
BEAMLINE 

X-RAY GENERATOR MODEL 
MONOCHROMATIC OR LAUE 
WAVELENGTH OR RANGE 
MONOCHROMATOR 
OPTICS 



DETECTOR TYPE 
DETECTOR MANUFACTURER 
INTENSITY-INTEGRATION SOFTWARE 
DATA SCALING SOFTWARE 

NUMBER OF UNIQUE REFLECTIONS 
RESOLUTION RANGE HIGH (A) 
RESOLUTION RANGE LOW (A) 
REJECTION CRITERIA (SIGMA(I)) 

OVERALL . 
COMPLETENESS FOR RANGE (%) 
DATA REDUNDANCY 

R MERGE CD 
R SYM (I) 
<I/SIGMA(I)> FOR THE DATA SET 



X-RAY DIFFRACTION 

21-OCT-1999 

100.0 

6.50 

1 

Y 

NSLS 
X4A 
NULL 
M 

0.9791 

NULL 

NULL 

CCD 

ADSC QUANTUM 4 

DENZO 

SCALE PACK 

47875 
1.600 
25.000 
0.000 



96.6 

9.400 

0.04300 

NULL 

16.4000 



,60 
.66 



IN THE HIGHEST RESOLUTION SHELL. 

HIGHEST RESOLUTION SHELL, RANGE HIGH (A) 

HIGHEST RESOLUTION SHELL, RANGE LOW (A) 

COMPLETENESS FOR SHELL (%) : 8 6.0 

DATA REDUNDANCY IN SHELL : 4.30 

R MERGE FOR SHELL (I) : 0.29000 

R SYM FOR SHELL (I) : NULL 

<I/SIGMA(I)> FOR SHELL : NULL 



DIFFRACTION PROTOCOL: SINGLE WAVELENGTH 
METHOD USED TO DETERMINE THE STRUCTURE: NULL 
SOFTWARE USED: SHARP 
STARTING MODEL: NULL 

REMARK: NULL 

CRYSTAL 

SOLVENT CONTENT, VS (%) : NULL 

MATTHEWS COEFFICIENT, VM ( ANGSTROMS * * 3 / DA) : NULL 
CRYSTALLIZATION CONDITIONS: 0 . 1M MES PH6.5, 10% W/V 
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REMARK 280 POLYETHYLENE GLYCOL 20000, 3% W/V 1, 6-HEXANDIOL 
REMARK 290 

REMARK 290 CRYSTALLOGRAPHIC SYMMETRY 

REMARK 290 SYMMETRY OPERATORS FOR SPACE GROUP: P 21 21 2 
REMARK 2 90 



SYMOP 
NNNMMM 
1555 
2555 
3555 
4555 

WHERE NNN 
MMM 



SYMMETRY 
OPERATOR 
X,Y,Z 
-X,-Y,Z 

l/2-X,l/2+Y,-Z 
1/2+X, l/2-Y,-Z 



REMARK 290 
REMARK 290 
REMARK 2 90 
REMARK 290 
REMARK 290 
REMARK 2 90 
REMARK 290 
REMARK 2 90 
REMARK 290 
REMARK 2 90 

REMARK 2 90 CRYSTALLOGRAPHIC SYMMETRY TRANSFORMATIONS 

REMARK 290 THE FOLLOWING TRANSFORMATIONS OPERATE ON THE ATOM/HETATM 
REMARK 290 RECORDS IN THIS ENTRY TO PRODUCE CRYSTALLOGRAPHICALLY 
REMARK 290 RELATED MOLECULES. 



OPERATOR NUMBER 
TRANSLATION VECTOR 
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REMARK 


290 


SMTRY2 


3 


0. 


.000000 


1. 


,000000 


0. 


,000000 
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290 SMTRY2 


4 0. 
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26. 


.58350 
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0. 
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REMARK 2 90 

REMARK 290 REMARK: NULL 
REMARK 300 

REMARK 300 BIOMOLECULE : 1 

REMARK 300 THIS ENTRY CONTAINS THE CRYSTALLOGRAPHIC ASYMMETRIC UNIT 
REMARK 300 WHICH CONSISTS OF 2 CHAIN (S). SEE REMARK 350 FOR 
REMARK 300 INFORMATION ON GENERATING THE BIOLOGICAL MOLECULE (S). 
REMARK 350 

REMARK 350 GENERATING THE BIOMOLECULE 

REMARK 350 COORDINATES FOR A COMPLETE MULTIMER REPRESENTING THE KNOWN 
REMARK 350 BIOLOGICALLY SIGNIFICANT OLIGOMERIZATION STATE OF THE 
REMARK 3 50 MOLECULE CAN BE GENERATED BY APPLYING BIOMT TRANSFORMATIONS 
REMARK 350 GIVEN BELOW. BOTH NON-CRYSTALLOGRAPHIC AND 
REMARK 350 CRYSTALLOGRAPHIC OPERATIONS ARE GIVEN. 
REMARK 350 

REMARK 350 BIOMOLECULE: 1 

REMARK 350 APPLY THE FOLLOWING TO CHAINS: A, B 

REMARK 350 BIOMT1 1 1.000000 0.000000 0.000000 0.00000 
REMARK 350 BIOMT2 1 0.000000 1.000000 0.000000 0.00000 
REMARK 350 BIOMT3 1 0.000000 0.000000 1.000000 0.00000 
REMARK 4 65 

REMARK 465 MISSING RESIDUES 

REMARK 4 65 THE FOLLOWING RESIDUES WERE NOT LOCATED IN THE 
REMARK 465 EXPERIMENT. (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN 
REMARK 465 IDENTIFIER; SSSEQ=SEQUENCE NUMBER; I=INSERTION CODE.) 
REMARK 4 65 

REMARK 4 65 M RES C SSSEQI 
REMARK 4 65 GLU B 13 

REMARK 4 65 VAL B 14 

REMARK 4 65 LYS B 15 

PRO B 16 
REMARK 4 65 GLU B 17 

REMARK 4 65 VAL B 18 
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REMARK 4 65 LYS B 19 

REMARK 4 70 

REMARK 470 MISSING ATOM 

REMARK 470 THE FOLLOWING RESIDUES HAVE MISSING ATOMS (M=MODEL NUMBER; 
REMARK 470 RES=RESIDUE NAME; C=CHAIN IDENTIFIER; SSEQ=SEQUENCE NUMBER; 
REMARK 470 I=INSERTION CODE) : 



REMARK 


470 


M RES 


CSSEQI 


ATOMS 








REMARK 


470 


PRO 


B 


20 


CB 


CG 


CD 




REMARK 


470 


GLU 


B 


21 


CB 


CG 


CD 


OE1 


REMARK 


470 


LYS 


B 


54 


CG 


CD 


CE 


N2 


REMARK 


470 


GLU 


B 


59 


CG 


CD 


OE1 


OE2 


REMARK 


470 


ASP 


B 


75 


CB 


CG 


OD1 


OD2 


REMARK 


470 


GLU 


B 


84 


CG 


CD 


OE1 


OE2 



REMARK 500 

REMARK 500 GEOMETRY AND STEREOCHEMISTRY 

REMARK 500 SUBTOPIC: CLOSE CONTACTS IN SAME ASYMMETRIC UNIT 
REMARK 500 

REMARK 500 THE FOLLOWING ATOMS ARE IN CLOSE CONTACT . 
REMARK 500 



REMARK 500 


ATM1 


RES C 


SSEQI 


ATM2 


RES C 


SSEQI 








REMARK 500 


NE 


ARG A 


422 


O 


HOH 


377 


1. 


12 




REMARK 500 


CD 


ARG A 


422 


O 


HOH 


377 


2 


.02 


REMARK 


500 O 


HOH 


272 


NE 


ARG A 


422 




2.14 






REMARK 500 


CZ 


ARG A 


422 


O 


HOH 


377 


2. 


17 




REMARK 500 


O 


HOH 


457 


O 


HOH 


718 


2 


.17 


REMARK 



^ 500 

Cp REMARK 500 GEOMETRY AND STEREOCHEMISTRY 

gt REMARK 500 SUBTOPIC: CLOSE CONTACTS 

;T?i REMARK 500 

~t: REMARK 500 THE FOLLOWING ATOMS THAT ARE RELATED BY CRYSTALLOGRAPHIC 

CI REMARK 500 SYMMETRY ARE IN CLOSE CONTACT. AN ATOM LOCATED WITHIN 0.15 

fy REMARK 500 ANGSTROMS OF A SYMMETRY RELATED ATOM IS ASSUMED TO BE ON A 

''I* REMARK 500 SPECIAL POSITION AND IS, THEREFORE, LISTED IN REMARK 375 

J* REMARK 500 INSTEAD OF REMARK 500. ATOMS WITH NON-BLANK ALTERNATE 

N REMARK 500 LOCATION INDICATORS ARE NOT INCLUDED IN THE CALCULATIONS, 

s REMARK 500 

p* REMARK 500 DISTANCE CUTOFF: 

fzl REMARK 500 2.2 ANGSTROMS FOR CONTACTS NOT INVOLVING HYDROGEN ATOMS 

L r i REMARK 500 1.6 ANGSTROMS FOR CONTACTS INVOLVING HYDROGEN ATOMS 

M REMARK 500 



REMARK 


500 


ATM1 


RES C 


SSEQI 


ATM 2 


RES C 


SSEQI 


SSYMOP 


DISTANCE 


REMARK 


500 


O 


HOH 


419 


O 


HOH 


4 


3558 


1.28 


REMARK 


500 


O 


HOH 


426 


O 


HOH 


4 


3558 


1.62 


REMARK 


500 


O 


HOH 


61 


O 


HOH 


473 


3558 


1.68 


REMARK 


500 


O 


HOH 


578 


O 


HOH 


49 


3548 


2.07 


REMARK 


500 


O 


HOH 


578 


O 


HOH 


21 


3548 


2.18 


REMARK 


500 


O 


HOH 


24 


O 


HOH 


473 


3558 


2.18 



REMARK 500 

REMARK 500 GEOMETRY AND STEREOCHEMISTRY 
REMARK 500 SUBTOPIC: COVALENT BOND LENGTHS 
REMARK 500 

REMARK 500 THE STEREOCHEMICAL PARAMETERS OF THE FOLLOWING RESIDUES 

REMARK 500 HAVE VALUES WHICH DEVIATE FROM EXPECTED VALUES BY MORE REMARK 500 THAN 6*RMSD 
(M^MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN 

REMARK 500 IDENTIFIER; SSEQ=SEQUENCE NUMBER; I=INSERTION CODE) . 
REMARK 500 

REMARK 500 STANDARD TABLE: 

REMARK 500 FORMAT: ( 10X, 13 , IX, 2 (A3 , IX, Al , 14 , Al , IX, A4 , 3X) , F6 . 3 ) 
REMARK 500 

REMARK 500 EXPECTED VALUES: ENGH AND HUBER, 1991 
REMARK 500 

REMARK 500 M RES CSSEQI ATMl RES CSSEQI ATM2 DEVIATION 
REMARK 500 ARG A 422 CB ARG A 422 CA -0.335 
REMARK 500 GLY B 98 O GLY B 98 C 0.125 
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REMARK 


500 


REMARK 


500 i 


REMARK 


500 : 


REMARK 


500 


REMARK 


500 ' 


REMARK 


500 ] 


REMARK 


500 ' 


REMARK 


500 


REMARK 


500 


REMARK 


500 ; 


REMARK 


500 


REMARK 


500 


REMARK 


500 ] 


REMARK 


500 


REMARK 


500 


REMARK 


500 


REMARK 


500 


REMARK 


500 


REMARK 


500 i 


REMARK 


500 ; 


REMARK 


500 


REMARK 


500 ' 


REMARK 


500 


REMARK 


500 , 


REMARK 


500 


REMARK 


500 


REMARK 


500 


REMARK 


500 


REMARK 


500 


REMARK 


500 


REMARK 


525 


REMARK 


525 ; 


REMARK 


525 ' 


REMARK 


525 


REMARK 


525 . 


RES=RESIDUE 



COVALENT BOND ANGLES 



RESIDUE NAME; C=CHAIN 
I=INSERTION CODE) . 



(10X, 13, IX, A3, IX, Al, I4,A1, 3 (1X,A4, 2X) , 12X, F5. 1) 



1991 



M RES CSSEQI ATM1 ATM2 ATM 3 

ARG A 422 CB - CA - C ANGL. DEV. 
GLU A 423 CB - CG - CD ANGL. DEV. 



- 21. 

- 21. 



DEGREES 
DEGREES 



I=INSERTION CODE) . 



M RES CSSEQI 
GLU A 423 



PSI 
112. 62 



PHI 
151.65 



:ATED WITH A SYMMETRY RELATED MOLECULE (M=MODEL REMARK 525 NUMBER; 

C=CHAIN IDENTIFIER; SSEQ=SEQUENCE REMARK 525 NUMBER; I=INSERTION CODE) : 



REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

DBREF 

DBREF 

SEQADV 

SEQADV 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 



525 
525 
525 
525 
525 
525 
525 
525 
525 
525 
999 
999 
999 
999 



M 


RES 


CSSEQI 












0 


HOH 


322 


DISTANCE 




6. 


,11 


ANGSTROMS 


0 


HOH 


420 


DISTANCE 




6. 


.85 


ANGSTROMS 


0 


HOH 


457 


DISTANCE 




7. 


.05 


ANGSTROMS 


0 


HOH 


544 


DISTANCE 




7 . 


.02 


ANGSTROMS 


0 


HOH 


627 


DISTANCE 




7. 


.46 


ANGSTROMS 


0 


HOH 


673 


DISTANCE 




8. 


,63 


ANGSTROMS 


0 


HOH 


718 


DISTANCE 




5. 


.47 


ANGSTROMS 


0 


HOH 


746 


DISTANCE 




5. 


,09 


ANGSTROMS 



SEQUENCE 
RESIDUES 
DENSITY . 



13-19 OF CHAIN B WERE NOT SEEN IN THE ELECTRON 



999 THE SIDE CHAINS OF RESIDUES 20, 21, 54, 59, 75, AND 84 
999 IN CHAIN B WERE NOT SEEN IN THE ELECTRON DENSITY. 



1EUV A 401 
1EUV B 13 
1EUV GLY A 
1EUV SER A 



621 
98 



GB 
SWS 



1039457 
Q12306 



AAB68167 
SMT3 YEAST 



401 
13 



621 
98 



221 
221 
221 
221 
221 
221 
221 



401 GB 1039457 LYS 

402 GB 1039457 LYS 
GLY SER LEU VAL PRO GLU 
GLN VAL GLN LYS ALA LEU 
LEU MET ASN ARG ASP ASN 
PHE 
ILE 



401 CLONING ARTIFACTS 

402 CLONING ARTIFACTS 
LEU ASN GLU LYS ASP ASP ASP 
ALA SER ARG GLU ASN THR GLN 
ILE GLU ILE THR VAL ARG ASP 

LYS THR LEU ALA PRO ARG ARG TRP LEU ASN ASP THR 
ILE GLU PHE PHE MET LYS TYR ILE GLU LYS SER THR 



PRO ASN THR VAL ALA PHE ASN SER PHE PHE TYR THR ASN 
LEU SER GLU ARG GLY TYR GLN GLY VAL ARG ARG TRP MET 
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SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

SEQRES 

FORMUL 

HELIX 

HELIX 

HELIX 

HELIX 

HELIX 

HELIX 

HELIX 

HELIX 

HELIX 

HELIX 

HELIX 

HELIX 

HELIX 

SHEET 

SHEET 

SHEET 

SHEET 

SHEET 

SHEET 

SHEET 

SHEET 

SHEET 

SHEET 

SHEET 

SHEET 

LINK 

CRYST1 

ORIGX1 

ORIGX2 

ORIGX3 

SCALE 1 

SCALE2 

SCALE3 

ATOM 

AN I SOU 

ATOM 

ANISOU 

ATOM 

ANISOU 

ATOM 

ANISOU 

ATOM 

ANISOU 

ATOM 

ANISOU 

ATOM 



8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
1 
2 
3 
4 
5 
6 
7 
3 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
1 
2 
1 
2 
3 
4 
5 
1 
2 
3 
4 
5 



A 221 LYS ARG LYS LYS THR GLN ILE ASP LYS LEU ASP LYS ILE 

A 221 PHE THR PRO ILE ASN LEU ASN GLN SER HIS TRP ALA LEU 

A 221 GLY ILE ILE ASP LEU LYS LYS LYS THR ILE GLY TYR VAL 

A 221 ASP SER LEU SER ASN GLY PRO ASN ALA MET SER PHE ALA 

A 221 ILE LEU THR ASP LEU GLN LYS TYR VAL MET GLU GLU SER 

A 221 LYS HIS THR ILE GLY GLU ASP PHE ASP LEU ILE HIS LEU 

A 221 ASP CYS PRO GLN GLN PRO ASN GLY TYR ASP CYS GLY ILE 

A 221 TYR VAL CYS MET ASN THR LEU TYR GLY SER ALA ASP ALA 

A 221 PRO LEU ASP PHE ASP TYR LYS ASP ALA ILE ARG MET ARG 

A 221 ARG PHE ILE ALA HIS LEU ILE LEU THR ASP ALA LEU LYS 

B 8 6 GLU VAL LYS PRO GLU VAL LYS PRO GLU THR HIS ILE ASN 

B 86 LEU LYS VAL SER ASP GLY SER SER GLU ILE PHE PHE LYS 

B 8 6 ILE LYS LYS THR THR PRO LEU ARG ARG LEU MET GLU ALA 

B 8 6 PHE ALA LYS ARG GLN GLY LYS GLU MET ASP SER LEU ARG 

B 8 6 PHE LEU TYR ASP GLY ILE ARG ILE GLN ALA ASP GLN THR 

B 8 6 PRO GLU ASP LEU ASP MET GLU ASP ASN ASP ILE ILE GLU 

B 8 6 ALA HIS ARG GLU GLN ILE GLY GLY 

HOH M32(H2 01) 



A 
A 
B 
B 
B 
B 
B 
C 

c 
c 
c 
c 

SG 

125.774 



ASN 
VAL 
THR 
ASN 
SER 
TYR 
MET 

8 GLN 

9 ASN 

10 ASP 

11 ASP 

12 LEU B 

13 GLU B 
2 
2 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 



GLN 
ILE 
THR 
LYS 
HIS 
THR 
ASP 
GLU B 
HIS B 
ASP B 
LEU B 
ILE B 
CYS A 



408 

437 

442 

450 

473 

484 

491 

497 

538 

579 

600 
45 
59 
426 
433 
468 
503 
514 
526 
565 
34 
23 
87 
63 
70 
580 



A 
A 
A 
A 



ALA 
LYS 
ALA 
THR 
GLY A 
VAL A 
LYS 
LEU 
SER 
ALA 
THR 
GLN B 
ASP B 

ARG 

THR 

ALA 

LEU 

ASP 

VAL 

LEU 

LYS B 

SER B 
. ARG B 

TYR B 

ARG B 



A 
A 
A 
A 
A 



420 
441 
444 
465 
483 
487 
495 
501 
556 
593 
617 
56 
61 
430 
436 
470 
510 
521 
530 
569 
40 
29 
93 
67 
71 



ILE A 433 N ARG A 430 



O 
O 

o 

0 



53.167 54.262 90.00 
1.000000 0.000000 0.000000 
0.000000 1.000000 
0.000000 0.000000 
0.007950 0.000000 
0.000000 0.018810 
0.000000 0.000000 

GLY A 401 



N 
N 

CA 

CA 

C 

C 

O 

O 

N 

N 

CA 
CA 
C 



GLY 
GLY 
GLY 
GLY 
GLY 
GLY 
GLY 
SER 
SER 
SER 
SER 
SER 



401 
401 
401 
401 
401 
401 
401 
402 
402 
402 
402 
402 



LYS A 503 N VAL A 4 69 

HIS A 514 N LEU A 510 

THR A 526 N ASP A 521 

ASP A 565 N ILE A 527 

N ILE B 24 O ILE B 39 

O ASP B 87 N LYS B 27 

N ARG B 64 O HIS B 92 

O ILE B 70 N TYR B 67 

C GLY B 98 

90.00 90.00 P 21 21 2 4 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
16.972 92.970 1.00 40.83 

5054 34 -142 134 

.389 1.00 40.16 

86 -131 93 
.429 1.00 39.45 

96 -84 29 



0.000000 
1.000000 
0.000000 
0.000000 
0.018430 
33. 908 
5308 5152 

34.959 17.863 92 
5199 5101 4959 

34.276 18.853 91 
5132 5117 4741 



33.765 19.898 91.836 1.00 41.56 



5289 



5159 



5342 



113 



-1 



34.308 18.564 90.144 1.00 36.00 
4567 4596 4517 206 -292 237 

33.593 19.410 89.176 1.00 31.44 
4023 3888 4035 -16 -105 48 



13 
5 
3 
16 
11 
4 
5 
5 
19 
15 
18 
12 
3 



33,490 18.568 87.917 1.00 28.88 
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1=4 



AN I SOU 


7 


C 


SER 


A 


402 


3621 3603 


3750 


118 




1 67 




272 


ATOM 


8 


O 


SER 


A 


402 


34.398 17 


833 87 


545 1 


00 


30 , 


31 




AN I SOU 


8 


O 


SER 


A 


402 


3857 3763 


3896 


281 




148 




21o 


ATOM 


9 


CB 


SER 


A 


402 


34.236 20 


759 88. 


859 1 


00 


~i A 

30 . 


C A 

60 




AN I SOU 


9 


CB 


SER 


A 


402 


3984 3744 


3898 


1 94 




217 




1 95 


ATOM 


10 


OG 


SER 


A 


402 


33.428 21 


458 87 


901 1 


00 


28 . 


4 6 




AN I SOU 


10 


OG 


SER 


A 


402 


3763 3367 


3684 


85 




148 




70 


ATOM 


11 


N 


LEU 


A 


403 


32.358 18 


715 87, 


232 1 


00 


24 , 


4 8 




AN I SOU 


11 


N 


LEU 


A 


403 


3228 2729 


3344 


20 




-68 




91 


ATOM 


12 


CA 


LEU 


A 


403 


32.156 18 


045 85. 


960 1 


00 


22 . 


66 




AN I SOU 


12 


CA 


LEU 


A 


403 


2907 2577 


3127 


65 




11 




223 


ATOM 


13 


C 


LEU 


A 


403 


32.648 18 


911 84, 


813 1 


00 


21 


40 




AN I SOU 


13 


C 


LEU 


A 


403 


2696 2438 


2996 


85 




-85 




152 


ATOM 


14 


O 


LEU 


A 


403 


32.533 18 


491 83 


671 1 


00 


21 


43 




AN I SOU 


14 


O 


LEU 


A 


403 


3033 2035 


3075 


125 




44 




179 


ATOM 


15 


CB 


LEU 


A 


403 


30.654 17 


761 85 


763 1 


00 


21 


94 




AN I SOU 


15 


CB 


LEU 


A 


403 


2366 2410 


3000 


15 




85 




214 


ATOM 


16 


CG 


LEU 


A 


403 


30.070 16 


687 86 


706 1 


00 


22 


13 




AN I SOU 


16 


CG 


LEU 


A 


403 


2787 2618 


3003 


68 




171 




266 


ATOM 


17 


CD1 


LEU 


A 


403 


28.607 16 


437 86 


413 1 


00 


23 


95 




AN I SOU 


17 


CD1 


LEU 


A 


403 


3074 2778 


3249 


-32 




-70 




20 


ATOM 


18 


CD2 


LEU 


A 


403 


30.897 15 


404 86 


513 1 


00 


23 . 


40 




AN I SOU 


18 


CD2 


LEU 


A 


403 


3198 2409 


3283 


-36 




69 




60 


ATOM 


19 


N 


VAL 


A 


404 


33.143 20 


116 85 


092 1 


00 


22 


43 




AN I SOU 


19 


N 


VAL 


A 


404 


2884 2486 


3153 


114 


— 


106 




152 


ATOM 


20 


CA 


VAL 


A 


404 


33.683 20 


963 84 


021 1 


00 


22 


06 




AN I SOU 


20 


CA 


VAL 


A 


404 


2880 2453 


3050 


80 




-22 




91 


ATOM 


21 


C 


VAL 


A 


404 


35.197 20 


906 84 


138 1 


00 


22 


63 




AN I SOU 


21 


C 


VAL 


A 


404 


3081 2515 


3001 


138 




108 




249 


ATOM 


22 


O 


VAL 


A 


404 


35.785 21 


433 85 


067 1 


00 


24 


07 




ANISOU 


22 


O 


VAL 


A 


404 


2846 3020 


3279 


128 


— 


221 




178 


ATOM 


23 


CB 


VAL 


A 


404 


33.147 22 


416 84 


244 1 


00 


21 


98 




ANISOU 


23 


CB 


VAL 


A 


404 


2895 2505 


2951 


158 




-13 




131 


ATOM 


24 


CGI 


VAL 


A 


404 


33.678 23 


330 83 


137 1 


00 


21 


57 




ANISOU 


24 


CGI 


VAL 


A 


404 


2633 2638 


2926 


59 




-25 




132 


ATOM 


25 


CG2 


VAL 


A 


404 


31.636 22 


469 84 


333 1 


00 


20 


47 




ANISOU 


25 


CG2 


VAL 


A 


404 


2813 2171 


2794 


-47 




-76 




44 


ATOM 


26 


N 


PRO 


A 


405 


35.860 20 


317 83 


172 1 


00 


23 


66 




ANISOU 


26 


N 


PRO 


A 


405 


3008 2745 


3235 


89 




31 




168 


ATOM 


27 


CA 


PRO 


A 


405 


37.313 20 


171 83 


266 1 


00 


26 


77 




ANISOU 


27 


CA 


PRO 


A 


405 


3176 3461 


3535 


116 




-32 




99 


ATOM 


28 


C 


PRO 


A 


405 


38.073 21 


419 82 


867 1 


00 


30 


16 




ANISOU 


28 


C 


PRO 


A 


405 


3742 3725 


3992 


-99 




-6 




186 


ATOM 


29 


O 


PRO 


A 


405 


37.569 22 


254 82 


115 1 


00 


31 


05 




ANISOU 


29 


O 


PRO 


A 


405 


3572 4012 


4213 


133 




-42 




218 


ATOM 


30 


CB 


PRO 


A 


405 


37.602 19 


013 82 


310 1 


00 


27 


12 




AN I SOU 


30 


CB 


PRO 


A 


405 


3320 3480 


3506 


35 




-40 




78 


ATOM 


31 


CG 


PRO 


A 


405 


36.575 19 


165 81 


253 1 


00 


26 


08 




ANISOU 


31 


CG 


PRO 


A 


405 


3445 3222 


3241 


7 6 




27 




100 


ATOM 


32 


CD 


PRO 


A 


405 


35.322 19 


665 81 


962 1 


00 


24 


17 




ANISOU 


32 


CD 


PRO 


A 


405 


3120 2926 


3135 


87 




-29 




248 


ATOM 


33 


N 


GLU 


A 


406 


39.331 21 


452 83 


308 1 


00 


32 


15 




ANISOU 


33 


N 


GLU 


A 


406 


3743 4267 


4205 


62 




-o2 




0 4 


ATOM 


34 


CA 


GLU 


A 


406 


40.213 22 


54 6 82 


896 1 


00 


34 


4 1 




ANISOU 


34 


CA 


GLU 


A 


406 


4328 4297 


4449 


-26 




60 




111 


ATOM 


35 


C 


GLU 


A 


406 


40.962 22 


009 81 


678 1 


00 


33 


20 




ANISOU 


35 


C 


GLU 


A 


406 


4044 4240 


4329 


-11 




-78 




122 


ATOM 


36 


O 


GLU 


A 


406 


41.302 20 


813 81 


564 1 


00 


34 


60 




ANISOU 


36 


O 


GLU 


A 


406 


4388 4326 


4431 


23 




75 




21 


ATOM 


37 


CB 


GLU 


A 


406 


41.151 22 


996 84 


009 1 


00 


38 
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ATOM 1035 CE LYS A 524 15.909 12.024 89.102 1.00 34.85 C 

ANISOU 1035 CE LYS A 524 4336 4517 4389 -39 -25 -51 C 

ATOM 1036 NZ LYS A 524 16.514 11.646 87.793 1.00 34.69 N 

AN I SOU 1036 NZ LYS A 524 4213 4629 4338 -165 53 97 N 

ATOM 1037 N LYS A 525 9.092 11.343 86.867 1.00 24.54 N 

ANISOU 1037 N LYS A 525 3232 2630 3462 -213 127 220 N 

ATOM 1038 CA LYS A 525 7.767 11.927 86.682 1.00 25.62 C 
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ANISOU 1044 CE LYS A 525 4224 3964 4260 -113 121 -40 C 

ATOM 1045 NZ LYS A 525 4.799 7.982 84.634 1.00 32.11 N 
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L ATOM 1053 N ILE A 527 8.557 17.689 86.045 1.00 26.15 N 

O ANISOU 1053 N ILE A 527 3520 2973 3442 -183 293 93 N 

LH ATOM 1054 CA ILE A 527 8.913 18,681 85.013 1.00 25.72 C 
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ATOM 1062 CA GLY A 528 11.439 21.235 86.200 1.00 22.81 C 

ANISOU 1062 CA GLY A 528 3073 2565 3030 -163 179 105 C 

ATOM 1063 C GLY A 528 11.870 22.408 85.327 1.00 21.82 C 

ANISOU 1063 C GLY A 528 2935 2571 2784 -210 302 13 C 

ATOM 1064 O GLY A 528 12.145 22.227 84.130 1.00 21.67 O 

ANISOU 1064 O GLY A 528 3434 2090 2709 -316 324 118 O 

ATOM 1065 N TYR A 529 11.827 23.570 85.943 1.00 19.07 N 

ANISOU 1065 N TYR A 529 2557 2264 2423 -228 432 288 N 

ATOM 1066 CA TYR A 529 12.249 24.815 85.244 1,00 19.32 C 

ANISOU 1066 CA TYR A 529 2534 2140 2667 -71 173 302 C 
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ATOM 1067 C TYR A 529 13.436 25.315 86.040 1.00 20.01 C 

ANISOU 1067 C TYR A 529 2649 2308 2646 -15 174 67 C 

ATOM 1068 0 TYR A 529 13.318 25.491 87.260 1.00 20.14 0 

ANISOU 1068 0 TYR A 529 2515 2533 2604 -316 392 139 0 

ATOM 1069 CB TYR A 529 11.101 25.844 85.232 1.00 19.92 C 

ANISOU 1069 CB TYR A 529 2567 2258 2741 -13 180 66 C 

ATOM 1070 CG TYR A 529 11.593 27.177 84.663 1.00 19.55 C 

ANISOU 1070 CG TYR A 529 2497 2303 2629 -87 180 65 C 

ATOM 1071 CD1 TYR A 529 11.836 27.317 83.300 1.00 19.22 C 

ANISOU 1071 CD1 TYR A 529 2345 2387 2573 -129 118 52 C 

ATOM 1072 CD2 TYR A 529 11.825 28.242 85.498 1.00 21.21 C 

ANISOU 1072 CD2 TYR A 529 2756 2502 2800 -136 192 -9 C 

ATOM 1073 CE1 TYR A 529 12.275 28.523 82.766 1.00 19.18 C 

ANISOU 1073 CE1 TYR A 529 2320 2392 2573 -195 69 -36 C 

ATOM 1074 CE2 TYR A 529 12.292 29.454 84.977 1.00 20.66 C 

ANISOU 1074 CE2 TYR A 529 2508 2470 2873 -167 199 44 C 

ATOM 1075 CZ TYR A 529 12.478 29.588 83.629 1.00 19.62 C 

ANISOU 1075 CZ TYR A 529 2545 2147 2763 -154 133 -1 C 

ATOM 1076 OH TYR A 529 12.986 30.765 83.079 1.00 20.48 O 

ANISOU 1076 OH TYR A 529 2570 2316 2897 -321 401 29 O 

ATOM 1077 N VAL A 530 14.599 25.509 85.419 1.00 19.71 N 

ANISOU 1077 N VAL A 530 2583 2184 2721 -161 244 215 N 

ATOM 1078 CA VAL A 530 15.818 25.864 86.151 1.00 19.51 C 

ANISOU 1078 CA VAL A 530 2619 2202 2591 -49 261 158 C 

ATOM 1079 C VAL A 530 16.382 27.118 85.501 1.00 18.62 C 

Q ANISOU 1079 C VAL A 530 2445 2200 2429 -134 225 21 C 

^ ATOM 1080 O VAL A 530 16.599 27.123 84.306 1.00 19.34 O 

5 ANISOU 1080 O VAL A 530 2687 2294 2367 -271 166 61 O 
^ ATOM 1081 CB VAL A 530 16.807 24,679 86.249 1.00 21.91 C 
H! ANISOU 1081 CB VAL A 530 2857 2345 3124 -36 187 52 C 
m ATOM 1082 CGI VAL A 530 17.413 24.267 84.933 1.00 24.00 C 
~! ANISOU 1082 CGI VAL A 530 3085 2880 3154 43 244 34 C 

ATOM 1083 CG2 VAL A 530 17.857 24.947 87.297 1.00 22.89 C 

JS ANISOU 1083 CG2 VAL A 530 2902 2691 3104 182 80 1 C 

■■it ATOM 1084 N ASP A 531 16.541 28.148 86.313 1.00 18.01 N 

ANISOU 1084 N ASP A 531 2426 1966 2452 -183 389 124 N 

6 ATOM 1085 CA ASP A 531 16.922 29.466 85.757 1.00 17.65 C 
p ANISOU 1085 CA ASP A 531 2484 1877 2344 -179 246 29 C 
||f ATOM 1086 C ASP A 531 18.005 30.005 86.684 1.00 17.93 C 

ANISOU 1086 C ASP A 531 2407 2089 2317 -161 142 112 C 

-" P ATOM 1087 O ASP A 531 17.778 30.190 87.888 1.00 19.21 O 

SI ANISOU 1087 O ASP A 531 2771 2146 2384 -125 233 -33 O 

Q ATOM 1088 CB ASP A 531 15.637 30.285 85.686 1.00 18.96 C 

T ANISOU 1088 CB ASP A 531 2561 2170 2472 -54 84 46 C 

^ ATOM 1089 CG ASP A 531 15.766 31.672 85.084 1.00 19.66 C 

ANISOU 1089 CG ASP A 531 2837 2185 2448 25 86 15 C 

ATOM 1090 OD1 ASP A 531 16.844 32.274 85.205 1.00 20.25 O 

ANISOU 1090 OD1 ASP A 531 2842 2265 2585 2 151 94 O 

ATOM 1091 OD2 ASP A 531 14.731 32.150 84.536 1.00 19.95 O 

ANISOU 1091 OD2 ASP A 531 2846 2166 2567 -31 43 240 O 

ATOM 1092 N SER A 532 19.120 30.473 86.080 1.00 17.40 N 

ANISOU 1092 N SER A 532 2264 1902 2446 -72 297 -76 N 

ATOM 1093 CA SER A 532 20.242 30.997 86.832 1.00 18.39 C 

ANISOU 1093 CA SER A 532 2611 2139 2239 65 64 -49 C 

ATOM 1094 C SER A 532 20.136 32.508 87.153 1.00 18.84 C 

ANISOU 1094 C SER A 532 2556 2203 2400 -177 156 -105 C 

ATOM 1095 O SER A 532 21.073 33.027 87.750 1.00 18.74 O 

ANISOU 1095 O SER A 532 2671 2009 2441 -196 152 -251 O 

ATOM 1096 CB SER A 532 21.592 30.748 86.133 1.00 17.93 C 

ANISOU 1096 CB SER A 532 2371 2432 2008 -23 14 97 C 

ATOM 1097 OG SER A 532 21.558 31.336 84.814 1.00 17.00 O 

ANISOU 1097 OG SER A 532 2459 2032 1970 45 31 69 O 

ATOM 1098 N LEU A 533 19.032 33.127 86.821 1.00 19.85 N 

ANISOU 1098 N LEU A 533 2745 2151 2645 -51 165 -141 N 
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ATOM 1099 CA LEU A 533 18.818 34.515 87.294 1.00 21.71 C 

ANISOU 1099 CA LEU A 533 3000 2246 3002 103 143 -166 C 

ATOM 1100 C LEU A 533 17.338 34.623 87.615 1.00 23.14 C 

ANISOU 1100 C LEU A 533 2965 2634 3193 -16 25 -200 C 

ATOM 1101 O LEU A 533 16.559 35.275 86.940 1.00 24.86 O 

ANISOU 1101 O LEU A 533 3149 3073 3224 203 -39 -264 O 

ATOM 1102 CB LEU A 533 19.395 35.556 86.329 1.00 24.34 C 

ANISOU 1102 CB LEU A 533 3307 2753 3186 -43 112 53 C 

ATOM 1103 CG LEU A 533 19.592 36.950 86.969 1.00 27.67 C 

ANISOU 1103 CG LEU A 533 3878 3059 3577 -9 -7 -174 C 

ATOM 1104 CD1 LEU A 533 20.617 36.899 88.082 1.00 28.13 C 

ANISOU 1104 CD1 LEU A 533 3716 3229 3745 -53 -34 -148 C 

ATOM 1105 CD2 LEU A 533 20.033 37.930 85.892 1.00 29.92 C 

ANISOU 1105 CD2 LEU A 533 3970 3511 3887 -111 88 17 C 

ATOM 1106 N SER A 534 16.952 33.948 88.691 1.00 25.11 N 

ANISOU 1106 N SER A 534 3066 3104 3372 7 133 -128 N 

ATOM 1107 CA SERA 534 15.539 33.920 89.094 1.00 29.12 C 

ANISOU 1107 CA SER A 534 3196 3881 3988 -80 190 9 C 

ATOM 1108 C SER A 534 15.349 34.300 90.550 1.00 33.77 C 

ANISOU 1108 C SER A 534 4280 4419 4134 -194 87 -119 C 

ATOM 1109 O SER A 534 16.094 33.806 91.401 1.00 34.44 O 

ANISOU 1109 O SER A 534 4366 4668 4052 -165 174 -72 O 

ATOM 1110 CB SER A 534 15.093 32.458 88.916 1.00 29.92 C 

ANISOU 1110 CB SER A 534 3380 3929 4061 -138 236 -206 C 

^ ATOM 1111 OG SER A 534 13.809 32.243 89.499 1.00 30.70 O 

Q ANISOU 1111 OG SER A 534 3575 3946 4142 -250 236 -478 O 

%9 ATOM 1112 N ASN A 535 14.266 35.000 90.853 1.00 38.52 N 

ANISOU 1112 N ASN A 535 4635 4858 5144 -4 91 -59 N 

5^ ATOM 1113 CA ASN A 535 13.945 35.281 92.263 1.00 42.88 C 

U'l ANISOU 1113 CA ASN A 535 5476 5575 5240 -52 104 -85 C 

pi ATOM 1114 C ASN A 535 12.969 34.223 92.783 1.00 43.94 C 

jpjj : ANISOU 1114 C ASN A 535 5602 5574 5519 -83 98 -69 C 

ATOM 1115 O ASN A 535 12.632 34.188 93.981 1.00 45.16 O 

4* ANISOU 1115 O ASN A 535 5794 5834 5529 -62 126 -57 O 

%| ATOM 1116 CB ASN A 535 13.426 36.693 92.496 1.00 45.09 C 

ANISOU 1116 CB ASN A 535 5725 5679 5729 67 19 -85 C 

ATOM 1117 CG ASN A 535 14.484 37.713 92.100 1.00 46.65 C 

W ANISOU 1117 CG ASN A 535 5922 5870 5934 -51 50 -47 C 

tfl ATOM 1118 OD1 ASN A 535 15.647 37.604 92.509 1.00 47.97 O 

ANISOU 1118 OD1 ASN A 535 6025 6115 6085 -9 -45 -24 O 

ATOM 1119 ND2 ASN A 535 14.074 38.675 91.291 1.00 47.55 N 

ANISOU 1119 ND2 ASN A 535 6070 5977 6019 -7 -34 -26 N 

tj ATOM 1120 N GLY A 536 12.547 33.322 91.888 1.00 43.16 N 

Li ANISOU 1120 N GLY A 536 5433 5605 5360 -84 167 -75 N 

^ ATOM 1121 CA GLY A 536 11.720 32.219 92.314 1.00 42.57 C 

ANISOU 1121 CA GLY A 536 5433 5440 5300 -46 75 -137 C 

ATOM 1122 C GLY A 536 10.448 32.026 91.523 1.00 41.44 C 

ANISOU 1122 C GLY A 536 5306 5268 5172 28 180 -107 C 

ATOM 1123 O GLY A 536 10,131 32.713 90.550 1.00 39.62 O 

ANISOU 1123 O GLY A 536 4959 4833 5261 114 206 -212 O 

ATOM 1124 N PRO A 537 9.723 30.984 91.954 1.00 41.72 N 

ANISOU 1124 N PRO A 537 5401 5191 5259 -40 70 -135 N 

ATOM 1125 CA PRO A 537 8.483 30.615 91.290 1.00 41.96 C 

ANISOU 1125 CA PRO A 537 5303 5353 5288 -2 71 -64 C 

ATOM 1126 C PRO A 537 7.735 31.886 90.963 1.00 42,20 C 

ANISOU 1126 C PRO A 537 5449 5319 5267 -37 26 13 C 

ATOM 1127 O PRO A 537 7.541 32.752 91.840 1.00 43.58 O 

ANISOU 1127 O PRO A 537 5645 5509 5405 -72 73 -119 O 

ATOM 1128 CB PRO A 537 7.775 29.709 92.267 1.00 41.74 C 

ANISOU 1128 CB PRO A 537 5338 5221 5301 -24 57 -95 C 

ATOM 1129 CG PRO A 537 8.837 29.154 93.161 1.00 42.31 C 

ANISOU 1129 CG PRO A 537 5387 5305 5385 -51 26 -60 C 

ATOM 1130 CD PRO A 537 10.021 30.077 93.079 1.00 41.98 C 

ANISOU 1130 CD PRO A 537 5279 5392 5279 -36 56 -94 C 
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ATOM 1163 CG PHE A 542 3.352 28.591 87.485 1.00 35.58 C 

ANISOU 1163 CG PHE A 542 4335 4609 4575 -27 50 -3 C 

ATOM 1164 CD1 PHE A 542 3.521 29.567 88.444 1.00 37.40 C 

ANISOU 1164 CD1 PHE A 542 4758 4693 4760 -87 33 -70 C 

ATOM 1165 CD2 PHE A 542 3.359 27.251 87.839 1.00 37.35 C 

ANISOU 1165 CD2 PHE A 542 4793 4650 4749 22 48 -16 C 

ATOM 1166 CE1 PHE A 542 3.724 29.213 89.767 1.00 38.62 C 

ANISOU 1166 CE1 PHE A 542 4869 4968 4838 31 -6 49 C 

ATOM 1167 CE2 PHE A 542 3.576 26.892 89.156 1.00 38.49 C 

ANISOU 1167 CE2 PHE A 542 4888 4957 4780 -50 10 15 C 

, ATOM 1168 CZ PHE A 542 3.739 27.878 90.110 1.00 38.82 C 

ANISOU 1168 CZ PHE A 542 4945 4898 4908 -47 12 -23 C 

ATOM 1169 N ALA A 543 3.679 29.131 82.988 1.00 27.61 N 

ANISOU 1169 N ALA A 543 3063 3463 3966 -24 176 -153 N 

ATOM 1170 CA ALA A 543 3.312 28.898 81.608 1.00 28.18 C 

ANISOU 1170 CA ALA A 543 3316 3458 3934 62 109 -32 C 

ATOM 1171 C ALA A 543 4.315 27,934 80.978 1.00 27.24 C 

ANISOU 1171 C ALA A 543 3100 3419 3832 -39 -43 -41 C 

ATOM 1172 O ALA A 543 3.891 27.090 80.197 1.00 27.41 O 

ANISOU 1172 O ALA A 543 3093 3300 4021 -91 43 -34 O 

ATOM 1173 CB ALA A 543 3.207 30.162 80.775 1.00 30.56 C 

ANISOU 1173 CB ALA A 543 3874 3596 4141 47 28 63 C 

ATOM 1174 N ILE A 544 5.619 28.080 81.245 1.00 24.40 N 

ANISOU 1174 N ILE A 544 2987 2720 3564 -110 26 0 N 

ATOM 1175 CA ILE A 544 6.597 27.158 80.714 1.00 23.25 C 

0 ANISOU 1175 CA ILE A 544 2809 2822 3203 -109 -13 83 C 

,f* ATOM 1176 C ILE A 544 6.320 25.756 81.267 1.00 21.89 C 

ANISOU 1176 C ILE A 544 2410 2778 3129 -13 80 44 C 

Ul ATOM 1177 O ILE A 544 6.329 24.846 80.452 1.00 22.55 O 

tjl ANISOU 1177 O ILE A 544 2457 2733 3378 -308 59 30 O 

m ATOM 1178 CB ILE A 544 8.035 27.606 81.064 1.00 23.69 C 

5; ANISOU 1178 CB ILE A 544 2751 2967 3281 -29 41 77 C 

1%I ATOM 1179 CGI ILE A 544 8.336 28.899 80.280 1.00 24.17 C 

ANISOU 1179 CGI ILE A 544 2814 2955 3412 -190 -99 71 C 

\l ATOM 1180 CG2 ILE A 544 9.080 26.559 80.732 1.00 23.76 C 

'* ANISOU 1180 CG2 ILE A 544 2973 2883 3173 0 -53 -30 C 

ATOM 1181 CD1 ILE A 544 9.606 29.597 80.759 1.00 24.04 C 

p ANISOU 1181 CD1 ILE A 544 2749 2965 3420 -194 53 92 C 

[A ATOM 1182 N LEU A 545 6.202 25.602 82.589 1.00 23.24 N 

7l ANISOU 1182 N LEU A 545 2815 2786 3229 -90 47 105 N 

p ! ATOM 1183 CA LEU A 545 5.971 24.239 83.093 1.00 22.70 C 

%J ANISOU 1183 CA LEU A 545 2755 2763 3107 83 147 97 C 

PS ATOM 1184 C LEU A 545 4.719 23.633 82.517 1.00 23.03 C 

rT ANISOU 1184 C LEU A 545 2696 2820 3235 55 137 38 C 

p? ATOM 1185 O LEU A 545 4.740 22.456 82.099 1.00 22.91 O 

ANISOU 1185 O LEU A 545 2514 2725 3464 -50 277 67 O 

ATOM 1186 CB LEU A 545 5.975 24.256 84.626 1.00 23.22 C 

ANISOU 1186 CB LEU A 545 2832 2865 3127 17 -2 -49 C 

ATOM 1187 CG LEU A 545 7.316 24.654 85.247 1.00 23.44 C 

ANISOU 1187 CG LEU A 545 2968 2825 3114 -155 27 -19 C 

ATOM 1188 CD1 LEU A 545 7.091 25.039 86.694 1.00 25.32 C 

ANISOU 1188 CD1 LEU A 545 3398 3041 3181 -46 89 24 C 

ATOM 1189 CD2 LEU A 545 8.294 23.497 85.144 1.00 23.99 C 

ANISOU 1189 CD2 LEU A 545 3040 2933 3144 -124 71 -113 C 

ATOM 1190 N THR A 546 3.585 24.400 82.423 1.00 23.28 N 

ANISOU 1190 N THR A 546 2713 2810 3322 57 151 281 N 

ATOM 1191 CA THR A 546 2.410 23.752 81.835 1.00 23.36 C 

ANISOU 1191 CA THR A 546 2805 2868 3203 107 153 104 C 

ATOM 1192 C THR A 546 2.547 23.433 80.360 1.00 23.93 C 

ANISOU 1192 C THR A 546 2851 2892 3350 101 219 74 C 

ATOM 1193 O THR A 546 2.063 22.396 79.881 1.00 25.26 O 

ANISOU 1193 O THR A 546 2518 3265 3816 -11 308 -239 O 

ATOM 1194 CB THR A 54 6 1.134 24.602 82.122 1.00 23.80 C 

ANISOU 1194 CB THR A 546 2923 2874 3245 272 152 110 C 
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ATOM 1227 CG LYS A 550 -0.155 20.713 75.941 1.00 31.76 C 

ANISOU 1227 CG LYS A 550 4057 4034 3977 -202 -152 -12 C 

ATOM 1228 CD LYS A 550 -0.798 22.002 75.495 1.00 34.48 C 

ANISOU 1228 CD LYS A 550 4395 4240 4465 -27 -84 67 C 

ATOM 1229 CE LYS A 550 0.218 23.051 75.036 1.00 36.00 C 

ANISOU 1229 CE LYS A 550 4448 4437 4793 -124 -73 91 C 

ATOM 1230 NZ LYS A 550 -0.506 24.249 74.503 1.00 36.21 N 

ANISOU 1230 NZ LYS A 550 4398 4531 4830 -109 -120 125 N 

ATOM 1231 N TYR A 551 3.145 19.062 76.396 1.00 22.08 N 

ANISOU 1231 N TYR A 551 2503 2664 3225 -274 86 195 N 

ATOM 1232 CA TYR A 551 3.909 18.345 75.403 1.00 20.84 C 

ANISOU 1232 CA TYR A 551 2400 2545 2972 -330 -107 215 C 

ATOM 1233 C TYR A 551 3.939 16.846 75.739 1.00 22.16 C 

ANISOU 1233 C TYR A 551 2678 2630 3111 -121 -82 136 C 

ATOM 1234 O TYR A 551 3.842 16.000 74.871 1.00 23.19 O 

ANISOU 1234 O TYR A 551 2812 2901 3100 -254 -184 114 O 

ATOM 1235 CB TYR A 551 5.375 18.882 75.293 1.00 19.68 C 

ANISOU 1235 CB TYR A 551 2346 2340 2791 -357 -31 87 C 

ATOM 1236 CG TYR A 551 6.163 18.079 74.261 1.00 19.76 C 

ANISOU 1236 CG TYR A 551 2121 2480 2908 -346 16 116 C 

ATOM 1237 CD1 TYR A 551 5.966 18.245 72.892 1.00 19.83 C 

ANISOU 1237 CD1 TYR A 551 2216 2489 2828 -431 36 -34 C 

ATOM 1238 CD2 TYR A 551 7.059 17.110 74.685 1.00 20.99 C 

ANISOU 1238 CD2 TYR A 551 2399 2410 3165 -388 45 41 C 

ATOM 1239 CE1 TYR A 551 6.647 17.464 71.979 1.00 19.92 C 

P ANISOU 1239 CE1 TYR A 551 2278 2447 2845 -361 -42 -31 C 

if! ATOM 1240 CE2 TYR A 551 7.770 16.333 73.799 1.00 19.81 C 

.SI ANISOU 1240 CE2 TYR A 551 2230 2390 2907 -358 -83 -65 C 

f*£ ATOM 1241 CZ TYR A 551 7.554 16.512 72.449 1.00 20.68 C 

III ANISOU 1241 CZ TYR A 551 2485 2446 2925 -321 -18 79 C 

fp ATOM 1242 OH TYR A 551 8.268 15.769 71.555 1.00 21.83 O 

%\ ANISOU 1242 OH TYR A 551 2357 2564 3373 -132 -156 -63 O 

ATOM 1243 N VAL A 552 4.265 16.533 76.986 1.00 21.72 N 

+= ANISOU 1243 N VAL A 552 2386 2787 3078 -120 -30 180 N 

SJ ATOM 1244 CA VAL A 552 4.385 15.135 77.406 1.00 22.67 C 

ANISOU 1244 CA VAL A 552 2549 2802 3262 -149 -13 132 C 

ATOM 1245 C VAL A 552 3.043 14.392 77.218 1.00 23.51 C 

U ANISOU 1245 C VAL A 552 2671 2936 3325 -258 -5 62 C 

Ml ATOM 1246 O VAL A 552 3.042 13.279 76.656 1.00 24.33 O 

£^ ANISOU 1246 O VAL A 552 2665 3018 3563 -221 -92 32 O 

' . ATOM 1247 CB VAL A 552 4.835 15.022 78.856 1.00 22.88 C 

N ANISOU 1247 CB VAL A 552 2728 2733 3233 -267 -1 -12 C 

O ATOM 1248 CGI VAL A 552 4.887 13.538 79.270 1.00 23.22 C 

ANISOU 1248 CGI VAL A 552 2805 2751 3266 -108 -86 143 C 

P . ATOM 1249 CG2 VAL A 552 6.250 15.553 79.032 1.00 21.76 C 

ANISOU 1249 CG2 VAL A 552 2469 2657 3142 -30 -118 65 C 

ATOM 1250 N MET A 553 1.989 15.073 77.663 1.00 23.62 N 

ANISOU 1250 N MET A 553 2690 2877 3407 -202 -72 195 N 

ATOM 1251 CA MET A 553 0.652 14.423 77.505 1.00 24.68 C 

ANISOU 1251 CA MET A 553 2803 2951 3624 -329 -27 129 C 

ATOM 1252 C MET A 553 0,296 14.201 76.061 1.00 26.17 C 

ANISOU 1252 C MET A 553 3177 3123 3644 -57 -15 61 C 

ATOM 1253 O MET A 553 -0.177 13.097 75.669 1.00 28.15 O 

ANISOU 1253 O MET A 553 3440 3305 3950 -389 -109 53 O 

ATOM 1254 CB MET A 553 -0.353 15.281 78.272 1.00 23.70 C 

ANISOU 1254 CB MET A 553 2706 2827 3473 -467 -1 177 C 

ATOM 1255 CG MET A 553 -0.220 15.213 79.787 1.00 26.39 C 

ANISOU 1255 CG MET A 553 3285 3188 3553 -376 45 88 C 

ATOM 1256 SD MET A 553 -1.312 16.315 80.696 1.00 29.71 S 

ANISOU 1256 SD MET A 553 3253 3855 4178 -231 342 92 S 

ATOM 1257 CE MET A 553 -1.514 17.756 79.686 1.00 31.97 C 

ANISOU 1257 CE MET A 553 3853 4196 4098 -122 147 142 C 

ATOM 1258 N GLU A 554 0.511 15.163 75.164 1.00 25.97 N 

ANISOU 1258 N GLU A 554 2964 3201 3703 -86 4 102 N 
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ATOM 1259 CA GLU A 554 0.181 15.005 73.757 1.00 27.16 C 

ANISOU 1259 CA GLU A 554 3313 3308 3698 -127 -18 1 C 

ATOM 1260 C GLU A 554 1.098 14.065 73.003 1.00 26.08 C 

ANISOU 1260 C GLU A 554 3038 3297 3576 -196 -102 47 C 

ATOM 1261 0 GLU A 554 0.659 13.172 72.252 1.00 26.70 0 

ANISOU 1261 O GLU A 554 3074 3283 3790 -361 -293 12 O 

ATOM 1262 CB GLU A 554 0.117 16.374 73.050 1.00 29.34 C 

ANISOU 1262 CB GLU A 554 3799 3529 3820 -210 -66 159 C 

ATOM 1263 CG GLU A 554 -0.898 17.308 73.689 1.00 32.77 C 

ANISOU 1263 CG GLU A 554 4107 4049 4294 11 27 -15 C 

ATOM 1264 CD GLU A 554 -2.343 16.855 73.550 1.00 35.94 C 

ANISOU 1264 CD GLU A 554 4309 4585 4762 -137 -26 5 C 

ATOM 1265 OE1 GLU A 554 -2.631 16.312 72.470 1.00 37.57 O 

ANISOU 1265 OE1 GLU A 554 4520 4980 4775 -228 -112 -40 O 

ATOM 1266 OE2 GLU A 554 -3.104 17.042 74.519 1.00 37.27 O 

ANISOU 1266 OE2 GLU A 554 4434 4746 4980 -161 93 15 O 

ATOM 1267 N GLU A 555 2.411 14.166 73.183 1.00 24.96 N 

ANISOU 1267 N GLU A 555 2975 3048 3461 -271 -172 78 N 

ATOM 1268 CA GLU A 555 3.378 13.327 72.502 1.00 24.85 C 

ANISOU 1268 CA GLU A 555 2759 3221 3463 -315 -182 -14 C 

ATOM 1269 C GLU A 555 3.218 11.868 72.897 1.00 26.77 C 

ANISOU 1269 C GLU A 555 3176 3310 3686 -84 -143 59 C 

ATOM 1270 O GLU A 555 3.437 10.979 72.081 1.00 29.06 O 

ANISOU 1270 O GLU A 555 3680 3496 3868 -228 -53 -99 O 

ATOM 1271 CB GLU A 555 4.818 13.829 72.834 1.00 25.12 C 

0 ANISOU 1271 CB GLU A 555 2780 3207 3557 -214 -142 -116 C 
,f* ATOM 1272 CG GLU A 555 5.884 13.047 72.095 1.00 24.32 C 
£J ANISOU 1272 CG GLU A 555 2804 3053 3384 -277 -116 -104 C 
W ATOM 1273 CD GLU A 555 5.802 13.180 70.592 1.00 25.45 C 
111 ANISOU 1273 CD GLU A 555 3006 3215 3449 -194 -128 -23 C 
m ATOM 1274 OE1 GLU A 555 5.283 14.191 70.069 1.00 26.52 O 
«; ANISOU 1274 OE1 GLU A 555 3194 3175 3709 -237 -137 -15 O 
^ ATOM 1275 OE2 GLU A 555 6.302 12.257 69.917 1.00 25.01 O 
JjJ ANISOU 1275 OE2 GLU A 555 2964 3115 3424 -181 -95 68 O 
\\ ATOM 1276 N SER A 556 2.807 11.611 74.138 1.00 27.38 N 

ANISOU 1276 N SER A 556 3426 3272 3706 -160 -58 68 N 

1 ATOM 1277 CA SER A 556 2.605 10.236 74.590 1.00 28.32 C 
Q ANISOU 1277 CA SER A 556 3570 3307 3884 -260 -158 76 C 
Ifi ATOM 1278 C SER A 556 1.213 9.710 74.262 1.00 29.09 C 
lX ANISOU 1278 C SER A 556 3485 3512 4057 -240 -38 34 C 

* ATOM 1279 O SER A 556 0.870 8.642 74.794 1.00 30.33 O 

ANISOU 1279 O SER A 556 3797 3389 4339 -335 -138 2 O 

n ATOM 1280 CB SER A 556 2.796 10.137 76.107 1.00 27.22 C 

n ANISOU 1280 CB SER A 556 3373 3170 3800 -195 -28 63 C 

S 5 * ATOM 1281 OG SER A 556 1.787 10.829 76.803 1.00 26.02 O 

ANISOU 1281 OG SER A 556 2773 3122 3989 -399 -131 121 O 

ATOM 1282 N LYS A 557 0.404 10.484 73.579 1.00 30.03 N 

ANISOU 1282 N LYS A 557 3768 3536 4104 -271 -156 16 N 

ATOM 1283 CA LYS A 557 -0.977 10.070 73.286 1.00 32.80 C 

ANISOU 1283 CA LYS A 557 3756 4228 4479 -220 -63 41 C 

ATOM 1284 C LYS A 557 -1.721 9.825 74.577 1.00 33.23 C 

ANISOU 1284 C LYS A 557 3980 4202 4442 -136 -55 37 C 

ATOM 1285 O LYS A 557 -2.440 8.813 74.727 1.00 34.82 O 

ANISOU 1285 O LYS A 557 4210 4271 4749 -324 -139 -29 O 

ATOM 1286 CB LYS A 557 -0.924 8.833 72.392 1.00 34.77 C 

AN I SOU 1286 CB LYS A 557 4195 4368 4648 -148 -102 -73 C 

ATOM 1287 CG LYS A 557 -0.517 9.177 70.965 1.00 38.01 C 

ANISOU 1287 CG LYS A 557 4704 4951 4787 -114 -3 33 C 

ATOM 1288 CD LYS A 557 -0.088 7.923 70.218 1.00 40.41 C 

ANISOU 1288 CD LYS A 557 5065 5095 5196 -3 -16 -90 C 

ATOM 1289 CE LYS A 557 0.416 8.299 68.825 1.00 42.43 C 

ANISOU 1289 CE LYS A 557 5443 5425 5252 -33 16 9 C 

ATOM 1290 NZ LYS A 557 1.635 7.512 68.466 1.00 44.10 N 

ANISOU 1290 NZ LYS A 557 5512 5671 5574 47 47 -22 N 
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ATOM 1291 N HIS A 558 -1.604 10.765 75.502 1.00 32.60 N 

ANISOU 1291 N HIS A 558 3743 4183 4463 -187 -75 16 N 

ATOM 1292 CA HIS A 558 -2.246 10.757 76.800 1.00 33.68 C 

ANISOU 1292 CA HIS A 558 4072 4242 4482 -99 -14 35 C 

ATOM 1293 C HIS A 558 -1.999 9.509 77.618 1.00 33.69 C 

ANISOU 1293 C HIS A 558 4004 4189 4607 -55 5 17 C 

ATOM 1294 O HIS A 558 -2.929 9.008 78.287 1.00 35.32 O 

ANISOU 1294 O HIS A 558 4271 4416 4731 -80 201 169 O 

ATOM 1295 CB HIS A 558 -3.767 11.022 76.623 1.00 35.06 C 

ANISOU 1295 CB HIS A 558 4096 4524 4702 -69 -45 36 C 

ATOM 1296 CG HIS A 558 -3.906 12.416 76.059 1.00 36.32 C 

ANISOU 1296 CG HIS A 558 4365 4626 4807 30 -17 71 C 

ATOM 1297 NDl HIS A 558 -4.002 12.646 74.705 1.00 37.84 N 

ANISOU 1297 NDl HIS A 558 4643 4878 4855 33 -26 46 N 

ATOM 1298 CD2 HIS A 558 -3.882 13.615 76.676 1.00 36.40 C 

ANISOU 1298 CD2 HIS A 558 4386 4659 4786 -21 -48 6 C 

ATOM 1299 CE1 HIS A 558 -4.071 13.957 74.501 1.00 37.74 C 

ANISOU 1299 CE1 HIS A 558 4664 4836 4839 -51 26 9 C 

ATOM 1300 NE2 HIS A 558 -4.001 14.557 75.675 1.00 37.69 N 

ANISOU 1300 NE2 HIS A 558 4664 4849 4806 -83 17 46 N 

ATOM 1301 N THR A 559 -0.775 8.998 77.602 1.00 31.31 N 

ANISOU 1301 N THR A 559 3768 3728 4399 -297 -91 22 N 

ATOM 1302 CA THR A 559 -0.446 7.850 78.422 1.00 31.64 C 

ANISOU 1302 CA THR A 559 3894 3908 4220 -309 -12 118 C 

ATOM 1303 C THR A 559 0.316 8.269 79.664 1.00 31.07 C 

3 ANISOU 1303 C THR A 559 3985 3721 4100 -247 39 160 C 

7j ATOM 1304 O THR A 559 0.378 7.551 80.671 1.00 31.04 O 

f* ANISOU 1304 O THR A 559 4015 3561 4218 -350 46 222 O 

ATOM 1305 CB THR A 559 0.341 6.761 77.702 1.00 32.73 C 

m ANISOU 1305 CB THR A 559 4065 4057 4314 -146 -24 73 C 

kl ATOM 1306 OG1 THR A 559 1.490 7.238 76.996 1.00 32.65 O 

zl ANISOU 1306 OG1 THR A 559 4065 3911 4432 -285 -45 66 O 

FIJ ATOM 1307 CG2 THR A 559 -0.505 6.013 76.659 1.00 32.81 C 

J! ANISOU 1307 CG2 THR A 559 4071 4159 4235 -215 -79 135 C 

Ci ATOM 1308 N ILE A 560 1.098 9.370 79.523 1.00 28.80 N 

~' 4 ANISOU 1308 N ILE A 560 3655 3444 3845 -136 -90 99 N 

* ATOM 1309 CA ILE A 560 1.937 9.804 80.635 1.00 28.76 C 

H ANISOU 1309 CA ILE A 560 3735 3393 3797 -4 -42 47 C 

slj ATOM 1310 C ILE A 560 1.943 11.325 80.725 1.00 27.06 C 

ANISOU 1310 C ILE A 560 3319 3330 3632 -181 -12 97 C 

H ATOM 1311 O ILE A 560 1.502 12.002 79.780 1.00 25.47 O 

SJ ANISOU 1311 O ILE A 560 3138 3136 3404 -365 -109 -50 O 

r-h ATOM 1312 CB ILE A 560 3.402 9.351 80.550 1.00 30.47 C 

H ANISOU 1312 CB ILE A 560 3790 3772 4013 16 16 50 C 

ATOM 1313 CGI ILE A 560 4.066 9.737 79.235 1.00 30.69 C 

ANISOU 1313 CGI ILE A 560 3822 3865 3973 21 15 40 C 

ATOM 1314 CG2 ILE A 560 3.567 7.868 80.869 1.00 30.96 C 

ANISOU 1314 CG2 ILE A 560 3886 3731 4145 -19 -16 -20 C 

ATOM 1315 CD1 ILE A 560 5.570 9.465 79.241 1.00 30.76 C 

ANISOU 1315 CD1 ILE A 560 3679 3999 4009 -81 -71 48 C 

ATOM 1316 N GLY A 561 2.295 11.821 81.907 1.00 26.90 N 

ANISOU 1316 N GLY A 561 3362 3190 3669 -126 -48 83 N 

ATOM 1317 CA GLY A 561 2.398 13.268 82.091 1.00 26.89 C 

ANISOU 1317 CA GLY A 561 3295 3170 3751 -242 -57 141 C 

ATOM 1318 C GLY A 561 1.403 13.967 82.959 1.00 29.30 C 

ANISOU 1318 C GLY A 561 3630 3540 3963 -83 22 29 C 

ATOM 1319 O GLY A 561 1.754 14.916 83.667 1.00 28.44 O 

ANISOU 1319 O GLY A 561 3166 3646 3994 -347 60 87 O 

ATOM 1320 N GLU A 562 0.123 13.551 83.033 1.00 31.92 N 

ANISOU 1320 N GLU A 562 3812 3912 4403 -289 12 88 N 

ATOM 1321 CA GLU A 562 -0.894 14.265 83.803 1.00 33.43 C 

ANISOU 1321 CA GLU A 562 4179 4112 4413 -85 83 39 C 

ATOM 1322 C GLU A 562 -0.685 14.344 85.300 1.00 31.31 C 

ANISOU 1322 C GLU A 562 3697 3829 4369 -172 73 120 C 
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ATOM 1323 0 GLU A 562 -1.151 15.281 85.988 1.00 30.72 0 

ANISOU 1323 0 GLU A 562 3742 3700 4229 -265 131 229 0 

ATOM 1324 CB GLU A 562 -2.281 13.699 83.422 1.00 38.48 C 

ANISOU 1324 CB GLU A 562 4420 5013 5189 -158 -73 -61 C 

ATOM 1325 CG GLU A 562 -2.677 12.442 84.171 1.00 43.17 C 

ANISOU 1325 CG GLU A 562 5524 5245 5634 -35 53 163 C 

ATOM 1326 CD GLU A 562 -4.082 11.974 83.808 1.00 46.65 C 

ANISOU 1326 CD GLU A 562 5690 5941 6094 -89 -65 13 C 

ATOM 1327 OE1 GLU A 562 -4.977 12.830 83.576 1.00 47.41 O 

ANISOU 1327 OE1 GLU A 562 5810 5960 6244 15 -30 -13 O 

ATOM 1328 OE2 GLU A 562 -4.284 10.735 83.751 1.00 49.19 O 

ANISOU 1328 OE2 GLU A 562 6253 6001 6437 -48 -10 42 O 

ATOM 1329 N ASP A 563 0.084 13.431 85.890 1.00 28.74 N 

ANISOU 1329 N ASP A 563 3271 3553 4096 -421 112 69 N 

ATOM 1330 CA ASP A 563 0.382 13.368 87.291 1.00 29.52 C 

ANISOU 1330 CA ASP A 563 3462 3712 4041 -212 185 74 C 

ATOM 1331 C ASP A 563 1.820 13.792 87.605 1.00 28.85 C 

ANISOU 1331 C ASP A 563 3391 3756 3815 -92 68 66 C 

ATOM 1332 O ASP A 563 2.248 13.622 88.744 1.00 28.02 O 

ANISOU 1332 O ASP A 563 3147 3722 3775 -141 263 121 O 

ATOM 1333 CB ASP A 563 0.161 11.936 87.827 1.00 32.05 C 

ANISOU 1333 CB ASP A 563 4080 3863 4233 -219 158 177 C 

ATOM 1334 CG ASP A 563 1.014 10.881 87.188 1.00 33.86 C 

ANISOU 1334 CG ASP A 563 4361 4082 4421 -81 192 79 C 

ATOM 1335 OD1 ASP A 563 1.761 11.179 86.221 1.00 33.79 O 

0 ANISOU 1335 OD1 ASP A 563 4295 3981 4562 -259 185 232 O 
drt ATOM 1336 OD2 ASP A 563 0.998 9.670 87.573 1.00 35.37 O 
^ ANISOU 1336 OD2 ASP A 563 4698 4219 4524 -95 223 297 O 

ATOM 1337 N PHE A 564 2.504 14.388 86.608 1.00 28.32 N 

Hi ANISOU 1337 N PHE A 564 3435 3509 3816 -134 30 16 N 

01 ATOM 1338 CA PHE A 564 3.869 14.845 86.925 1.00 26.63 C 
= s" ANISOU 1338 CA PHE A 564 3161 3432 3526 -54 242 103 C 

ATOM 1339 C PHE A 564 3.870 16.034 87.880 1.00 27.46 C 

4" ANISOU 1339 C PHE A 564 3417 3352 3666 -121 181 107 C 

SI ATOM 1340 O PHE A 564 2.951 16.876 87.830 1.00 28.83 O 

^ * ANISOU 1340 O PHE A 564 3359 3570 4023 21 67 168 O 

1, ATOM 1341 CB PHE A 564 4.619 15.226 85.635 1.00 25.30 C 

W ANISOU 1341 CB PHE A 564 3155 3120 3337 -60 166 16 C 

||| ATOM 1342 CG PHE A 564 5.184 14.097 84.823 1.00 23.86 C 

ii ANISOU 1342 CG PHE A 564 2835 2962 3269 -76 81 31 C 

H ATOM 1343 CD1 PHE A 564 . 4.664 12.810 84.786 1.00 25.17 C 

N ANISOU 1343 CD1 PHE A 564 3212 2965 3386 -98 89 83 C 

rj ATOM 1344 CD2 PHE A 564 6.279 14.362 83.997 1.00 24.60 C 

£1 ANISOU 1344 CD2 PHE A 564 3173 3039 3135 -100 89 206 C 

' ** ATOM 1345 CE1 PHE A 564 5.247 11.848 83.996 1.00 24.83 C 

ANISOU 1345 CE1 PHE A 564 3137 3070 3228 -218 54 9 C 

ATOM 1346 CE2 PHE A 564 6.862 13.391 83.236 1.00 25.49 C 

ANISOU 1346 CE2 PHE A 564 3178 3121 3387 -210 130 21 C 

ATOM 1347 CZ PHE A 564 6.346 12.084 83.205 1.00 25.85 C 

ANISOU 1347 CZ PHE A 564 3282 3142 3396 -227 72 15 C 

ATOM 1348 N ASP A 565 4.894 16.155 88.717 1.00 27.01 N 

ANISOU 1348 N ASP A 565 3369 3467 3428 -112 231 327 N 

ATOM 1349 CA ASP A 565 5.046 17.272 89.628 1.00 30.49 C 

ANISOU 1349 CA ASP A 565 3846 3764 3974 -168 262 14 C 

ATOM 1350 C ASP A 565 5.778 18.401 88.872 1.00 30.83 C 

ANISOU 1350 C ASP A 565 3881 3829 4005 -90 262 104 C 

ATOM 1351 O ASP A 565 6.840 18.072 88.319 1.00 31.71 O 

ANISOU 1351 O ASP A 565 4021 3723 4302 -125 478 1 O 

ATOM 1352 CB ASP A 565 5.877 16.880 90.853 1.00 34.44 C 

ANISOU 1352 CB ASP A 565 4294 4408 4386 -13 15 135 C 

ATOM 1353 CG ASP A 565 5.699 17.779 92.064 1.00 37.62 C 

ANISOU 1353 CG ASP A 565 4837 4805 4652 -46 71 -62 C 

ATOM 1354 OD1 ASP A 565 4.717 18.577 92.121 1.00 40.24 O 

ANISOU 1354 OD1 ASP A 565 5238 4996 5054 143 156 -92 O 
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ATOM 1355 OD2 ASP A 565 6.516 17.728 93,003 1.00 38.63 O 

ANISOU 1355 OD2 ASP A 565 4933 4931 4814 -107 8-1 0 

ATOM 1356 N LEU A 566 5.202 19.595 88.854 1.00 29.98 N 

ANISOU 1356 N LEU A 566 3876 3820 3693 -106 276 60 N 

ATOM 1357 CA LEU A 566 5.817 20.740 88.179 1.00 30.69 C 

ANISOU 1357 CA LEU A 566 3846 3857 3959 -155 273 54 C 

ATOM 1358 C LEU A 566 6.625 21.511 89.207 1.00 31.32 C 

ANISOU 1358 C LEU A 566 3889 3974 4039 -175 302 -41 C 

ATOM 1359 O LEU A 566 6.082 21.906 90.256 1.00 31.88 O 

ANISOU 1359 O LEU A 566 4061 3950 4101 -311 440 -145 O 

ATOM 1360 CB LEU A 566 4.771 21.619 87.510 1.00 32.50 C 

ANISOU 1360 CB LEU A 566 4165 4051 4133 -30 140 86 C 

ATOM 1361 CG LEU A 566 3.753 20.999 86.573 1.00 32.34 C 

ANISOU 1361 CG LEU A 566 4113 3941 4233 -74 147 37 C 

ATOM 1362 CD1 LEU A 566 2.804 21.979 85.892 1.00 34.10 C 

ANISOU 1362 CD1 LEU A 566 4423 4203 4330 2 96 117 C 

ATOM 1363 CD2 LEU A 566 4.410 20.150 85.495 1.00 33.68 C 

ANISOU 1363 CD2 LEU A 566 4321 4277 4201 -5 160 -15 C 

ATOM 1364 N ILE A 567 7.932 21.719 88.987 1.00 28.81 N 

ANISOU 1364 N ILE A 567 3728 3373 3848 -51 290 95 N 

ATOM 1365 CA ILE A 567 8.763 22.332 90.009 1.00 28.23 C 

ANISOU 1365 CA ILE A 567 3572 3500 3657 -99 358 153 C 

ATOM 1366 C ILE A 567 9.680 23.441 89.482 1.00 26.45 C 

ANISOU 1366 C ILE A 567 3392 3251 3409 -13 152 101 C 

ATOM 1367 O ILE A 567 10.130 23.274 88.350 1.00 25.65 O 

Q ANISOU 1367 O ILE A 567 3337 2913 3496 49 247 121 O 

ifg ATOM 1368 CB ILE A 567 9.629 21.266 90.701 1.00 30.86 C 

X ANISOU 1368 CB ILE A 567 3950 3828 3949 -20 158 286 C 

*?* ATOM 1369 CGI ILE A 567 10.558 21.854 91.768 1.00 32.95 C 

01 ANISOU 1369 CGI ILE A 567 4180 4044 4296 -87 -11 174 C 

01 ATOM 1370 CG2 ILE A 567 10.493 20.436 89.775 1.00 33.22 C 

ft S ANISOU 1370 CG2 ILE A 567 4314 4161 4146 40 183 98 C 

s ^ ATOM 1371 CD1 ILE A 567 9.803 22.334 92.986 1.00 35.67 C 

4* ANISOU 1371 CD1 ILE A 567 4647 4518 4388 -50 78 43 C 

\| ATOM 1372 N HIS A 568 9.807 24.516 90.224 1.00 24.84 N 

ANISOU 1372 N HIS A 568 3102 3302 3033 -61 145 208 N 

1, ATOM 1373 CA HIS A 568 10.759 25.581 89.879 1.00 25.70 C 

Q ANISOU 1373 CA HIS A 568 3155 3457 3152 -146 39 134 C 

U1 ATOM 1374 C HIS A 568 12.034 25.130 90.629 1.00 26.73 C 

ANISOU 1374 C HIS A 568 3310 3625 3222 57 66 117 C 

:! ATOM 1375 O HIS A 568 12.050 25.213 91.863 1.00 27.81 O 

N ANISOU 1375 O HIS A 568 3427 3972 3166 -24 -6 196 O 

H| ATOM 1376 CB HIS A 568 10.274 26.945 90.324 1.00 27.99 C 

C\ ANISOU 1376 CB HIS A 568 3593 3575 3466 -43 52 27 C 

ATOM 1377 CG HIS A 568 11.100 28.126 89.865 1.00 31.14 C 

ANISOU 1377 CG HIS A 568 4036 3871 3925 -196 95 52 C 

ATOM 1378 ND1 HIS A 568 10.560 29.082 89.037 1.00 31.70 N 

ANISOU 1378 ND1 HIS A 568 4013 3873 4159 46 146 11 N 

ATOM 1379 CD2 HIS A 568 12.379 28.514 90.071 1.00 33.15 C 

ANISOU 1379 CD2 HIS A 568 4084 4198 4315 -162 39 39 C 

ATOM 1380 CE1 HIS A 568 11.485 30.007 88.750 1.00 32.74 C 

ANISOU 1380 CE1 HIS A 568 4265 3912 4262 -44 10 -53 C 

ATOM 1381 NE2 HIS A 568 12.590 29.695 89.372 1.00 32.32 N 

ANISOU 1381 NE2 HIS A 568 4113 3897 4272 -26 19 -126 N 

ATOM 1382 N LEU A 569 12.987 24.543 89.901 1.00 25.62 N 

ANISOU 1382 N LEU A 569 3271 3325 3140 -43 239 315 N 

ATOM 1383 CA LEU A 569 14.184 23.990 90.507 1.00 27.08 C 

ANISOU 1383 CA LEU A 569 3364 3567 3360 -6 133 217 C 

ATOM 1384 C LEU A 569 15.212 25.016 90.919 1.00 28.36 C 

ANISOU 1384 C LEU A 569 3513 3742 3521 -60 36 187 C 

ATOM 1385 O LEU A 569 15.493 25,954 90.149 1.00 29.04 O 

ANISOU 1385 O LEU A 569 3578 3840 3615 -195 68 312 O 

ATOM 1386 CB LEU A 569 14.881 23.040 89.496 1.00 28.77 C 

ANISOU 1386 CB LEU A 569 3662 3640 3628 70 193 103 C 
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ATOM 1387 CG LEU A 569 14.059 21.884 88.940 1.00 31.08 C 

ANISOU 1387 CG LEU A 569 3961 3909 3940 -25 110 -85 C 

ATOM 1388 CD1 LEU A 569 14.730 21.287 87,700 1.00 30.23 C 

ANISOU 1388 CD1 LEU A 569 3875 3692 3918 -64 86 -16 C 

ATOM 1389 CD2 LEU A 569 13.837 20.810 90.005 1.00 32.36 C 

ANISOU 1389 CD2 LEU A 569 4118 3988 4190 120 86 72 C 

ATOM 1390 N ASP A 570 15.805 24.873 92.082 1.00 28.26 N 

ANISOU 1390 N ASP A 570 3498 3877 3362 1 239 274 N 

ATOM 1391 CA ASP A 570 16.844 25.760 92.553 1.00 29.70 C 

ANISOU 1391 CA ASP A 570 3647 3905 3733 -38 194 88 C 

ATOM 1392 C ASP A 570 18.160 25.495 91.825 1.00 26.46 C 

ANISOU 1392 C ASP A 570 3306 3347 3401 -55 -19 182 C 

ATOM 1393 O ASP A 570 18.451 24.337 91.609 1.00 25.67 O 

ANISOU 1393 O ASP A 570 3032 3198 3522 -383 422 424 O 

ATOM 1394 CB ASP A 570 17.215 25.391 94.013 1.00 34.38 C 

ANISOU 1394 CB ASP A 570 4400 4612 4052 6 -90 179 C 

ATOM 1395 CG ASP A 570 16.044 25.734 94.911 1.00 37.90 C 

ANISOU 1395 CG ASP A 570 4636 5086 4680 44 129 -33 C 

ATOM 1396 OD1 ASP A 570 15.544 26.859 94.700 1.00 40.78 O 

ANISOU 1396 OD1 ASP A 570 5161 5227 5107 130 22 32 O 

ATOM 1397 OD2 ASP A 570 15.736 24.842 95.727 1.00 39.89 O 

ANISOU 1397 OD2 ASP A 570 4875 5415 4867 24 160 151 O 

ATOM 1398 N CYS A 571 18.994 26.481 91.568 1.00 25.36 N 

ANISOU 1398 N CYS A 571 3286 3138 3213 61 32 321 N 

& ATOM 1399 CA CYS A 571 20.318 26.223 91.010 1.00 23.70 C 

^ ANISOU 1399 CA CYS A 571 3352 2818 2836 49 91 50 C 

ATOM 1400 C CYS A 571 21.241 27.380 91.342 1.00 23.46 C 

35 ANISOU 1400 C CYS A 571 3208 2812 2892 28 171 138 C 

sV* ATOM 1401 O CYS A 571 20.748 28.429 91.748 1.00 23.13 O 

Zl ANISOU 1401 O CYS A 571 3183 2724 2883 -24 310 220 O 

Us ATOM 1402 CB CYS A 571 20.278 25.968 89.475 1.00 22.21 C 

ftj ANISOU 1402 CB CYS A 571 3231 2509 2699 171 148 210 C 

~j* ATOM 1403 SG CYS A 571 20.045 27,501 88.538 1.00 21.88 S 

7! ANISOU 1403 SG CYS A 571 3662 2459 2192 96 372 113 S 

N ATOM 1404 N PRO A 572 22.548 27.222 91.217 1.00 24.54 N 

a ANISOU 1404 N PRO A 572 3200 2947 3176 7 178 40 N 

<p* ATOM 1405 CA PRO A 572 23.518 28.285 91.440 1.00 24.11 C 

ANISOU 1405 CA PRO A 572 3218 2836 3106 56 174 81 C 

Wi ATOM 1406 C PRO A 572 23.171 29.463 90.534 1.00 23.17 C 

M= ANISOU 1406 C PRO A 572 3137 2779 2886 -51 148 16 C 

\A ATOM 1407 O PRO A 572 • 22.841 29.259 89.382 1.00 22.63 O 

,Zt ANISOU 1407 O PRO A 572 3190 2682 2727 -133 307 149 O 

U ATOM 1408 CB PRO A 572 24.876 27.683 91.122 1.00 25.09 C 

M> ANISOU 1408 CB PRO A 572 3137 3059 3335 79 110 88 C 

ATOM 1409 CG PRO A 57 2 24.643 26.190 91.232 1.00 26.29 C 

ANISOU 1409 CG PRO A 572 3405 3154 3429 -151 50 17 C 

ATOM 1410 CD PRO A 572 23.218 25.967 90.760 1.00 24.99 C 

ANISOU 1410 CD PRO A 572 3202 2978 3315 44 151 87 C 

ATOM 1411 N GLN A 573 23.191 30.663 91.113 1.00 21.01 N 

ANISOU 1411 N GLN A 573 3038 2644 2303 -81 295 134 N 

ATOM 1412 CA GLN A 573 22.731 31.830 90.337 1.00 21.31 C 

ANISOU 1412 CA GLN A 573 2952 2581 2562 25 191 48 C 

ATOM 1413 C GLN A 573 23.958 32.538 89.749 1.00 21.12 C 

ANISOU 1413 C GLN A 573 2861 2722 2442 -37 166 16 C 

ATOM 1414 O GLN A 57 3 25.010 32.581 90.392 1.00 22.44 O 

ANISOU 1414 O GLN A 573 3115 2770 2641 -192 111 93 O 

ATOM 1415 CB GLN A 573 22.020 32.789 91.312 1.00 21.04 C 

ANISOU 1415 CB GLN A 573 2833 2769 2391 -95 130 -104 C 

ATOM 1416 CG GLN A 573 20.721 32.169 91.876 1.00 21.77 C 

ANISOU 1416 CG GLN A 573 2805 2889 2578 26 239 -49 C 

ATOM 1417 CD GLN A 573 19.661 31.965 90.827 1.00 23.08 C 

ANISOU 1417 CD GLN A 573 3171 2986 2614 -15 180 -41 C 

ATOM 1418 OE1 GLN A 573 19.046 32.951 90.356 1.00 23.03 O 

ANISOU 1418 OE1 GLN A 573 3086 3003 2660 125 314 -10 O 
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ATOM 1419 NE2 GLN A 573 19.416 30.727 90.379 1.00 22.52 N 

ANISOU 1419 NE2 GLN A 573 3255 2844 2459 55 306 49 N 

ATOM 1420 N GLN A 574 23.812 33.038 88.526 1,00 20.37 N 

ANISOU 1420 N GLN A 574 2923 2289 2526 -162 125 126 N 

ATOM 1421 CA GLN A 574 24.967 33.723 87.887 1.00 21.00 C 

ANISOU 1421 CA GLN A 574 2788 2536 2655 -225 48 -26 C 

ATOM 1422 C GLN A 574 25.221 35.048 88.588 1.00 21.03 C 

ANISOU 1422 C GLN A 574 2803 2462 2728 -96 115 -40 C 

ATOM 1423 O GLN A 574 24.270 35.792 88.897 1.00 20.78 O 

ANISOU 1423 O GLN A 574 2756 2377 2764 -203 51 -265 O 

ATOM 1424 CB GLN A 574 24.584 33.888 86.406 1.00 21.23 C 

ANISOU 1424 CB GLN A 574 2863 2623 2581 -24 279 155 C 

ATOM 1425 CG GLN A 574 23.446 34.893 86.183 1.00 22.92 C 

ANISOU 1425 CG GLN A 574 3015 2804 2888 -35 38 62 C 

ATOM 1426 CD GLN A 574 22.949 34.783 84.758 1.00 23.64 C 

ANISOU 1426 CD GLN A 574 3062 3021 2900 -95 34 91 C 

ATOM 1427 OE1 GLN A 574 22.371 33.774 84.366 1.00 23.51 O 

ANISOU 1427 OE1 GLN A 574 3291 2898 2742 -59 107 119 O 

ATOM 1428 NE2 GLN A 574 23.236 35.797 83.948 1.00 24.35 N 

ANISOU 1428 NE2 GLN A 574 3349 2886 3017 -177 -65 102 N 

ATOM 1429 N PRO A 575 26.489 35.441 88.705 1.00 20.01 N 

ANISOU 1429 N PRO A 575 2714 2225 2664 -105 42 -50 N 

ATOM 1430 CA PRO A 575 26.856 36.702 89.356 1.00 21.59 C 

ANISOU 1430 CA PRO A 575 3074 2403 2727 -130 -50 -53 C 

ATOM 1431 C PRO A 575 27.025 37.844 88.371 1.00 22.37 C 

^ ANISOU 1431 C PRO A 575 3188 2633 2678 -219 69 11 C 

^ ATOM 1432 O PRO A 575 27.517 38.919 88.750 1.00 25.00 O 

QEj ANISOU 1432 O PRO A 575 3881 2758 2858 -423 101 -59 O 

ATOM 1433 CB PRO A 575 28.211 36.325 89.960 1.00 21.25 C 

ANISOU 1433 CB PRO A 575 2817 2454 2802 -119 79 -128 C 

3- ATOM 1434 CG PRO A 575 28.847 35.492 88.873 1.00 20.60 C 

fU ANISOU 1434 CG PRO A 575 2747 2276 2804 -76 88 -36 C 

p ATOM 1435 CD PRO A 575 27.686 34.608 88.385 1.00 21.45 C 

: V " ANISOU 1435 CD PRO A 575 2591 2479 3080 -64 -82 69 C 

^ ATOM 1436 N ASN A 576 26.620 37.623 87.130 1.00 21.02 N 

s ANISOU 1436 N ASN A 576 3061 2327 2598 -112 54 85 N 

ATOM 1437 CA ASN A 576 26.821 38.599 86.049 1.00 19.66 C 

^ ANISOU 1437 CA ASN A 576 2870 2133 2467 -138 -7 -31 C 

ATOM 1438 C ASN A 576 25.662 38.544 85.081 1.00 19.65 C 

M> ANISOU 1438 C ASN A 576 2863 2157 2445 106 84 -20 C 

ATOM 1439 O ASN A 576 24.758 37.713 85.260 1.00 20.86 O 

«2 ANISOU 1439 O ASN A 576 2867 2310 2750 -104 72 -9 O 

U ATOM 1440 CB ASN A 576 28.154 38.398 85.322 1.00 19.47 C 

M ANISOU 1440 CB ASN A 576 2697 2189 2512 -95 -112 -51 C 

ATOM 1441 CG ASN A 576 28.245 36.983 84.730 1.00 20.85 C 

ANISOU 1441 CG ASN A 576 2773 2331 2817 -49 -169 -107 C 

ATOM 1442 OD1 ASN A 576 27.374 36.665 83.914 1.00 19.46 O 

ANISOU 1442 OD1 ASN A 576 3092 1840 2461 -30 -274 121 O 

ATOM 1443 ND2 ASN A 576 29.208 36.159 85.131 1.00 21.52 N 

ANISOU 1443 ND2 ASN A 576 2989 2195 2993 -122 -90 96 N 

ATOM 1444 N GLY A 577 25.706 39.455 84.077 1.00 18.58 N 

ANISOU 1444 N GLY A 577 2819 1935 2307 165 27 -114 N 

ATOM 1445 CA GLY A 577 24.568 39.415 83.156 1.00 19.56 C 

ANISOU 1445 CA GLY A 577 2721 2208 2502 302 -5 -106 C 

ATOM 1446 C GLY A 577 24.797 38.665 81.855 1.00 19.64 C 

ANISOU 1446 C GLY A 577 2836 2245 2383 48 138 15 C 

ATOM 1447 O GLY A 577 23,850 38.716 81.066 1.00 22.23 O 

ANISOU 1447 O GLY A 577 3087 2518 2841 255 -44 27 O 

ATOM 1448 N TYR A 578 25.878 37.922 81.682 1.00 16.92 N 

ANISOU 1448 N TYR A 578 2714 1577 2137 -76 165 21 N 

ATOM 1449 CA TYR A 578 26.145 37.336 80.362 1.00 16.24 C 

ANISOU 1449 CA TYR A 578 2585 1616 1969 -66 24 85 C 

ATOM 1450 C TYR A 578 26.284 35.807 80.384 1.00 16.73 C 

ANISOU 1450 C TYR A 578 2584 1605 2170 22 28 -38 C 
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ATOM 1451 0 TYR A 578 26.277 35.191 79.311 1.00 17.44 0 

ANISOQ 1451 0 TYR A 578 2695 1778 2152 -307 -43 -34 O 

ATOM 1452 CB TYR A 578 27.464 37.881 79.819 1.00 18.05 C 

ANISOU 1452 CB TYR A 578 2417 1988 2454 -51 -9 200 C 

ATOM 1453 CG TYR A 578 28.616 37.965 80.797 1.00 17.53 C 

ANISOU 1453 CG TYR A 578 2369 2120 2173 -157 100 80 C 

ATOM 1454 CD1 TYR A 578 29.377 36.836 81.036 1.00 17.77 C 

ANISOU 1454 CD1 TYR A 578 2327 2085 2339 -193 92 220 C 

ATOM 1455 CD2 TYR A 578 28.955 39.125 81.509 1.00 17.90 C 

ANISOU 1455 CD2 TYR A 578 2503 2039 2258 -265 23 192 C 

ATOM 1456 CE1 TYR A 578 30.460 36.820 81.913 1.00 18.18 C 

ANISOU 1456 CE1 TYR A 578 2563 2145 2199 -269 -51 201 C 

ATOM 1457 CE2 TYR A 578 30.000 39.140 82.409 1.00 18.63 C 

ANISOU 1457 CE2 TYR A 578 2506 2192 2381 -171 -48 211 C 

ATOM 1458 CZ TYR A 578 30.767 38.002 82.588 1.00 19.75 C 

ANISOU 1458 CZ TYR A 578 2728 2199 2576 -85 -60 26 C 

ATOM 1459 OH TYR A 578 31.839 38.006 83.459 1.00 20.94 O 

ANISOU 1459 OH TYR A 578 3103 2377 2475 -225 -188 383 O 

ATOM 1460 N ASP A 579 26.364 35.218 81,553 1.00 16.24 N 

ANISOU 1460 N ASP A 579 2383 1562 2224 -32 -128 -93 N 

ATOM 1461 CA ASP A 579 26.597 33.756 81.601 1.00 15.40 C 

ANISOU 1461 CA ASP A 579 2375 1479 1997 -149 -153 31 C 

ATOM 1462 C ASP A 579 25.363 32.895 81.521 1.00 15.59 C 

ANISOU 1462 C ASP A 579 2026 1757 2142 -1 35 9 C 

~s ATOM 1463 O ASP A 579 25.522 31.662 81.674 1.00 16.28 O 

'% ANISOU 1463 O ASP A 579 2433 1646 2108 -37 199 9 O 

ATOM 1464 CB ASP A 579 27.393 33.402 82.866 1.00 16.76 C 

gj ANISOU 1464 CB ASP A 579 2320 1856 2192 -222 -213 171 C 

m ATOM 1465 CG ASP A 579 28.892 33.467 82.657 1.00 18.53 C 

%l ANISOU 1465 CG ASP A 579 2559 2127 2355 55 -69 56 C 

%M ATOM 1466 OD1 ASP A 579 29.395 33.366 81.509 1.00 18.83 O 

flj ANISOU 1466 OD1 ASP A 579 2163 2493 2500 -31 -71 125 O 

j* ATOM 1467 OD2 ASP A 579 29.610 33.626 83.673 1.00 20.44 O 

3! ANISOU 1467 OD2 ASP A 579 2765 2218 2782 59 -321 171 O 

N ATOM 1468 N CYS A 580 24.182 33.463 81.341 1.00 14.58 N 

s ANISOU 1468 N CYS A 580 1786 1749 2004 -121 -45 4 N 

ATOM 1469 CA CYS A 580 22.970 32.593 81.394 1.00 14.75 C 

ANISOU 1469 CA CYS A 580 1887 1776 1941 -146 73 49 C 

^ ATOM 1470 C CYS A 580 23.109 31.389 80.475 1.00 15.30 C 

f=* ANISOU 1470 C CYS A 580 2005 1882 1927 -119 -34 50 C 

ATOM 1471 O CYS A 580 22.645 30.293 80.890 1.00 15.92 O 

ANISOU 1471 O CYS A 580 2281 1646 2122 -91 156 -60 O 

U ATOM 1472 CB CYS A 580 21.662 33.321 81.112 1.00 16.92 C 

H* ANISOU 1472 CB CYS A 580 2430 2088 1910 101 33 49 C 

ATOM 1473 SG CYS A 580 21.746 34.182 79.490 1.00 15.01 S 

ANISOU 1473 SG CYS A 580 2141 1455 2108 50 101 14 S 

ATOM 1474 N GLY A 581 23.559 31.538 79.246 1.00 15.26 N 

ANISOU 1474 N GLY A 581 2113 1666 2021 44 147 -83 N 

ATOM 1475 CA GLY A 581 23.611 30.365 78.313 1.00 15.15 C 

ANISOU 1475 CA GLY A 581 2305 1638 1815 -29 -224 -2 C 

ATOM 1476 C GLY A 581 24.626 29.341 78.879 1.00 12.80 C 

ANISOU 1476 C GLY A 581 1518 1669 1677 -133 135 98 C 

ATOM 1477 O GLY A 581 24.369 28.115 78.689 1.00 16.14 O 

ANISOU 1477 O GLY A 581 2411 1526 2194 74 128 65 O 

ATOM 1478 N ILE A 582 25.694 29.752 79.489 1.00 13.89 N 

ANISOU 1478 N ILE A 582 2309 1315 1655 145 100 105 N 

ATOM 1479 CA ILE A 582 26.664 28.816 80.097 1.00 11.67 C 

ANISOU 1479 CA ILE A 582 942 1769 1723 2 25 110 C 

ATOM 1480 C ILE A 582 25.980 28.097 81.274 1.00 14.76 C 

ANISOU 1480 C ILE A 582 2254 1734 1621 235 -19 111 C 

ATOM 1481 O ILE A 582 26.205 26.886 81.440 1.00 16.85 O 

ANISOU 1481 O ILE A 582 2551 1737 2117 -13 -5 89 O 

ATOM 1482 CB ILE A 582 27.966 29.614 80.495 1.00 14.03 C 

ANISOU 1482 CB ILE A 582 2097 1580 1655 -29 -270 186 C 
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ATOM 1483 CGI ILE A 582 28.659 29.974 79.220 1.00 15.12 C 

ANISOU 1483 CGI ILE A 582 1710 1814 2222 98 161 -32 C 

ATOM 1484 CG2 ILE A 582 28.785 28.740 81.423 1.00 14.20 C 

ANISOU 1484 CG2 ILE A 582 1517 1940 1938 -94 -207 270 C 

ATOM 1485 CD1 ILE A 582 29.291 28.852 78.392 1.00 17.25 C 

ANISOU 1485 CD1 ILE A 582 2564 2012 1976 -73 221 -182 C 

ATOM 1486 N TYR A 583 25.198 28.807 82.100 1.00 14.54 N 

ANISOU 1486 N TYR A 583 1670 1839 2016 -52 192 33 N 

ATOM 1487 CA TYR A 583 24.513 28.141 83.214 1.00 15.88 C 

ANISOU 1487 CA TYR A 583 2282 1866 1886 99 44 89 C 

ATOM 1488 C TYR A 583 23.505 27.151 82.647 1.00 14.88 C 

ANISOU 1488 C TYR A 583 1675 1970 2011 -51 -43 77 C 

ATOM 1489 O TYR A 583 23.331 26.047 83.234 1.00 16.28 O 

ANISOU 1489 O TYR A 583 2221 1849 2117 32 106 9 O 

ATOM 1490 CB TYR A 583 23.892 29.165 84.154 1.00 15.73 C 

ANISOU 1490 CB TYR A 583 2254 1807 1916 0 -54 -20 C 

ATOM 1491 CG TYR A 583 24.884 29.693 85.160 1.00 16.77 C 

ANISOU 1491 CG TYR A 583 1988 2161 2224 -35 -82 32 C 

ATOM 1492 CD1 TYR A 583 25.956 30.477 84.772 1.00 16.09 C 

ANISOU 1492 CD1 TYR A 583 2224 1720 2170 14 13 -41 C 

ATOM 1493 CD2 TYR A 583 24.758 29.358 86.520 1.00 17.99 C 

ANISOU 1493 CD2 TYR A 583 2333 2237 2264 -19 21 83 C 

ATOM 1494 CE1 TYR A 583 26.870 30.952 85.725 1.00 16.44 C 

ANISOU 1494 CE1 TYR A 583 2273 1814 2159 0 -103 65 C 

r* ATOM 1495 CE2 TYR A 583 25.676 29.799 87.454 1.00 19.12 C 

*2 ANISOU 1495 CE2 TYR A 583 2700 2169 2395 -77 -3 48 C 

C s ATOM 1496 CZ TYR A 583 26.715 30.622 87.051 1.00 18.47 C 

SI ANISOU 1496 CZ TYR A 583 2359 2362 2296 -46 19 34 C 

ATOM 1497 OH TYR A 583 27.676 31.089 87.931 1.00 19.08 O 

'%l ANISOU 1497 OH TYR A 583 2802 2068 2378 -85 -99 -10 O 

k' 5 ATOM 1498 N VALA 584 22.856 27.430 81.524 1.00 14.35 N 

f!| ANISOU 1498 N VAL A 584 2297 1448 1706 -6 40 26 N 

jf ATOM 1499 CA VAL A 584 21.917 26.471 80.896 1.00 13.49 C 

ANISOU 1499 CA VAL A 584 1239 1687 2199 -25 -198 -53 C 

ATOM 1500 C VAL A 584 22.739 25.200 80.575 1.00 15.85 C 

e ANISOU 1500 C VAL A 584 2076 1654 2293 -75 84 -156 C 

p ATOM 1501 O VAL A 584 22.297 24,067 80.838 1.00 16.30 O 

^ ANISOU 1501 O VAL A 584 2203 1711 2280 38 89 137 O 

Ul ATOM 1502 CB VAL A 584 21.225 27.070 79.676 1.00 14.64 C 

M ANISOU 1502 CB VAL A 584 2065 1802 1698 148 93 -168 C 

\\ ATOM 1503 CGI VAL A 584 20.504 25.998 78.788 1.00 16.28 C 

~1 ANISOU 1503 CGI VAL A 584 2014 1854 2317 -183 -253 -128 C 

^ ATOM 1504 CG2 VAL A 584 20.116 28.036 80.136 1.00 13.73 C 

M ANISOU 1504 CG2 VAL A 584 1648 1723 1846 126 74 24 C 

ATOM 1505 N CYS A 585 23.890 25.410 79.928 1.00 14.55 N 

ANISOU 1505 N CYS A 585 1829 1839 1861 149 84 -212 N 

ATOM 1506 CA CYS A 585 24.745 24.225 79.586 1.00 14.62 C 

ANISOU 1506 CA CYS A 585 1925 1546 2085 63 68 -103 C 

ATOM 1507 C CYS A 585 25.196 23.483 80.821 1.00 15.99 C 

ANISOU 1507 C CYS A 585 2138 1672 2266 166 93 24 C 

ATOM 1508 O CYS A 585 25.133 22.222 80.860 1.00 15.86 O 

ANISOU 1508 O CYS A 585 2047 1605 2374 -109 216 -105 O 

ATOM 1509 CB CYS A 585 25.953 24.692 78.743 1.00 16.25 C 

ANISOU 1509 CB CYS A 585 2385 1769 2019 -125 137 117 C 

ATOM 1510 SG CYS A 585 25.410 25.282 77.100 1.00 15.94 S 

ANISOU 1510 SG CYS A 585 2463 1609 1983 -35 232 134 S 

ATOM 1511 N MET A 586 25.607 24.149 81.898 1.00 16.65 N 

ANISOU 1511 N MET A 586 2185 1896 2246 -11 -3 75 N 

ATOM 1512 CA MET A 586 26.014 23.444 83.132 1.00 16.64 C 

ANISOU 1512 CA MET A 586 2076 1907 2338 20 -31 109 C 

ATOM 1513 C MET A 586 24.812 22.720 83.714 1.00 16.83 C 

ANISOU 1513 C MET A 586 2199 1798 2397 59 87 79 C 

ATOM 1514 O MET A 586 24.916 21.580 84.199 1.00 16.82 O 

ANISOU 1514 O MET A 586 2377 1660 2353 80 111 80 O 
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ATOM 1515 CB MET A 586 26.665 24.349 84.176 1.00 17.32 C 

ANISOU 1515 CB MET A 586 2143 2171 2266 -74 -39 84 C 

ATOM 1516 CG MET A 586 27.984 24.955 83.586 1.00 17.11 C 

ANISOU 1516 CG MET A 586 2322 2028 2150 -153 160 -64 C 

ATOM 1517 SD MET A 586 28.929 25.753 84.903 1.00 17.40 S 

ANISOU 1517 SD MET A 586 2281 1987 2344 -108 -37 71 S 

ATOM 1518 CE MET A 586 27.885 27.197 85,107 1.00 18.54 C 

ANISOU 1518 CE MET A 586 2749 1903 2393 33 -59 113 C 

ATOM 1519 N ASN A 587 23.624 23.345 83.747 1.00 14.87 N 

ANISOU 1519 N ASN A 587 1994 1530 2125 -56 81 -73 N 

ATOM 1520 CA ASN A 587 22.424 22.671 84,251 1.00 16.26 C 

ANISOU 1520 CA ASN A 587 1976 1864 2337 61 169 49 C 

ATOM 1521 C ASN A 587 22.050 21.459 83.429 1.00 16.93 C 

ANISOU 1521 C ASN A 587 2086 1969 2380 -52 -21 53 C 

ATOM 1522 O ASN A 587 21.545 20.458 84.021 1.00 19.17 O 

ANISOU 1522 O ASN A 587 2601 1963 2720 -43 137 198 O 

ATOM 1523 CB ASN A 587 21.243 23.661 84.322 1,00 16.58 C 

ANISOU 1523 CB ASN A 587 2216 1901 2184 196 62 55 C 

ATOM 1524 CG ASN A 587 21.249 24.420 85.628 1.00 17.93 C 

ANISOU 1524 CG ASN A 587 2440 2145 2227 135 83 81 C 

ATOM 1525 OD1 ASN A 587 21.423 23.843 86.692 1.00 18.16 O 

ANISOU 1525 OD1 ASN A 587 2542 2048 2309 197 181 210 O 

ATOM 1526 ND2 ASN A 587 20.952 25.724 85.554 1.00 17.76 N 

ANISOU 1526 ND2 ASN A 587 2369 2019 2361 122 345 2 N 

ATOM 1527 N THR A 588 22.316 21.472 82.133 1.00 17.13 N 

Q ANISOU 1527 N THR A 588 2480 1643 2386 9 28 9 N 

*Q ATOM 1528 CA THR A 588 22.011 20.287 81.285 1.00 16.51 C 

m ANISOU 1528 CA THR A 588 2153 1685 2436 -35 35 30 C 

ATOM 1529 C THR A 588 23.054 19.219 81.598 1.00 17.11 C 

ANISOU 1529 C THR A 588 2223 1914 2365 -6 -188 54 C 

E(! ATOM 1530 O THR A 588 22.697 18.035 81.711 1.00 18.81 O 

m ANISOU 1530 O THR A 588 2513 1878 2755 -142 75 40 O 

5 2 ATOM 1531 CB THR A 588 22.026 20.673 79.818 1.00 17.47 C 

4* ANISOU 1531 CB THR A 588 2187 2155 2297 172 -2 -123 C 

S| ATOM 1532 OG1 THR A 588 21.090 21.795 79.642 1.00 16.25 O 

AN I SOU 1532 OG1 THR A 588 2279 1995 1900 185 19 146 O 

1 % ATOM 1533 CG2 THR A 588 21.626 19.570 78.851 1.00 18.62 C 

W ANISOU 1533 CG2 THR A 588 2485 2105 2483 206 -10 -123 C 

HI ATOM 1534 N LEU A 589 24.324 19.627 81.659 1.00 17.02 N 

CI ANISOU 1534 N LEU A 589 2103 1940 2424 138 30 66 N 

ATOM 1535 CA LEU A 589 25.396 18.656 81.965 1.00 16.26 C 

AN I SOU 1535 CA LEU A 589 2115 1733 2330 29 -46 29 C 

p ATOM 1536 C LEU A 589 25.164 17.998 83.308 1.00 17.93 C 

y, ANISOU 1536 C LEU A 589 2438 2010 2363 16 -91 52 C 

ATOM 1537 O LEU A 589 25.126 16.737 83.381 1.00 19.00 O 

ANISOU 1537 O LEU A 589 2729 1916 2573 93 -51 47 O 

ATOM 1538 CB LEU A 589 26.732 19.403 81.963 1.00 17.41 C 

ANISOU 1538 CB LEU A 589 2107 2050 2457 36 -46 3 C 

ATOM 1539 CG LEU A 589 27.962 18.670 82.523 1.00 17.90 C 

ANISOU 1539 CG LEU A 589 2019 2230 2553 108 64 -41 C 

ATOM 1540 CD1 LEU A 589 28.251 17.540 81.558 1.00 20.93 C 

ANISOU 1540 CD1 LEU A 589 2940 2349 2665 92 -74 -147 C 

ATOM 1541 CD2 LEU A 589 29.115 19.669 82.572 1.00 19.00 C 

ANISOU 1541 CD2 LEU A 589 2119 2499 2600 -83 24 4 C 

ATOM 1542 N TYR A 590 24.943 18.746 84.356 1.00 17.57 N 

ANISOU 1542 N TYR A 590 2164 2120 2393 -8 -66 -6 N 

ATOM 1543 CA TYR A 590 24.757 18.186 85.696 1.00 19.35 C 

ANISOU 1543 CA TYR A 590 2457 2321 2573 -163 88 69 C 

ATOM 1544 C TYR A 590 23.427 17.447 85.756 1.00 19.84 C 

ANISOU 1544 C TYR A 590 2584 2248 2707 -36 -78 68 C 

ATOM 1545 O TYR A 590 23.429 16.355 86.324 1.00 19.99 O 

ANISOU 1545 O TYR A 590 2457 2270 2868 76 -19 254 O 

ATOM 1546 CB TYR A 590 24.808 19.275 86.791 1.00 18.47 C 

ANISOU 1546 CB TYR A 590 2249 2297 2472 -75 74 29 C 
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ATOM 1579 C ALA A 595 24.059 16.537 90.580 1.00 25.46 C 

ANISOU 1579 C ALA A 595 3060 3203 3410 -98 17 165 C 

ATOM 1580 0 ALA A 595 23.307 17.328 89.966 1.00 27.24 0 

ANISOU 1580 0 ALA A 595 3680 3042 3627 -125 -35 377 0 

ATOM 1581 CB ALA A 595 25.823 15.778 88.991 1.00 23.95 C 

ANISOU 1581 CB ALA A 595 2852 2904 3344 -40 -101 74 C 

ATOM 1582 N PRO A 596 24.375 16.744 91.844 1.00 26.95 N 

ANISOU 1582 N PRO A 596 3555 3310 3376 -124 30 231 N 

ATOM 1583 CA PRO A 596 23.931 17.860 92.645 1.00 27.99 C 

ANISOU 1583 CA PRO A 596 3785 3410 3438 -94 23 227 C 

ATOM 1584 C PRO A 596 24.365 19.165 91.958 1.00 27.48 C 

ANISOU 1584 C PRO A 596 3681 3320 3440 -101 -44 185 C 

ATOM 1585 O PRO A 596 25.433 19.130 91.368 1.00 26.15 O 

ANISOU 1585 O PRO A 596 3607 3060 3269 -96 -133 283 O 

ATOM 1586 CB PRO A 596 24.703 17.704 93.953 1.00 29.33 C 

ANISOU 1586 CB PRO A 596 3911 3724 3508 11 -42 93 C 

ATOM 1587 CG PRO A 596 24.951 16.225 94.025 1.00 29.64 C 

ANISOU 1587 CG PRO A 596 4003 3733 3526 -84 35 123 C 

ATOM 1588 CD PRO A 596 25.295 15.831 92.601 1.00 28.42 C 

ANISOU 1588 CD PRO A 596 3716 3566 3515 14 -69 96 C 

ATOM 1589 N LEU A 597 23.524 20.174 91.973 1.00 27.94 N 

ANISOU 1589 N LEU A 597 3929 3250 3436 -51 33 213 N 

ATOM 1590 CA LEU A 597 23.906 21.420 91.243 1.00 27.38 C 

ANISOU 1590 CA LEU A 597 3727 3383 3295 -79 53 277 C 

ATOM 1591 C LEU A 597 24.791 22,293 92.113 1.00 28.84 C 

Ej ANISOU 1591 C LEU A 597 3977 3446 3536 -157 79 201 C 

,f% ATOM 1592 O LEU A 597 24.268 23.207 92.758 1.00 29.55 O 

2 ANISOU 1592 O LEU A 597 4210 3436 3584 -201 165 175 O 

W ATOM 1593 CB LEU A 597 22.596 22.126 90.814 1.00 25.56 C 

U1 ANISOU 1593 CB LEU A 597 3600 3108 3003 -102 223 184 C 

ATOM 1594 CG LEU A 597 21.613 21.275 90.017 1.00 25.30 C 

~l ANISOU 1594 CG LEU A 597 3413 3191 3009 42 160 174 C 

§y ATOM 1595 CD1 LEU A 597 20.386 22.064 89.594 1.00 23.99 C 

=P ANISOU 1595 CD1 LEU A 597 3367 2805 2944 -20 230 185 C 

! \I ATOM 1596 CD2 LEU A 597 22.201 20.601 88.769 1.00 24.88 C 

ANISOU 1596 CD2 LEU A 597 3302 2962 3190 169 113 120 C 

s ATOM 1597 N ASP A 598 26.090 21.974 92.163 1.00 29.04 N 

O ANISOU 1597 N ASP A 598 3977 3497 3560 -191 14 369 N 

Iff ATOM 1598 CA ASP A 598 26.990 22.694 93.045 1.00 31.19 C 

ANISOU 1598 CA ASP A 598 3988 4002 3862 -191 -48 194 C 

^ ATOM 1599 C ASP A 598 28.111 23.413 92.290 1.00 29.95 C 

SJ ANISOU 1599 C ASP A 598 3848 3913 3620 -147 -65 214 C 

f* ATOM 1600 O ASP A 598 29.121 23.819 92.878 1.00 31.27 O 

ANISOU 1600 O ASP A 598 4012 4046 3825 -244 -108 309 O 

ATOM 1601 CB ASP A 598 27.572 21.752 94.100 1.00 32.87 C 

ANISOU 1601 CB ASP A 598 4345 4096 4047 -52 -63 262 C 

ATOM 1602 CG ASP A 598 28.265 20.527 93.570 1.00 35.53 C 

ANISOU 1602 CG ASP A 598 4633 4443 4423 47 15 30 C 

ATOM 1603 OD1 ASP A 598 28.549 20.335 92.374 1.00 34.43 O 

ANISOU 1603 OD1 ASP A 598 4502 4297 4285 19 -53 249 O 

ATOM 1604 OD2 ASP A 598 28.529 19.643 94.444 1.00 37.50 O 

ANISOU 1604 OD2 ASP A 598 4992 4669 4588 135 -79 138 O 

ATOM 1605 N PHE A 599 27.917 23.551 90.992 1.00 26.85 N 

ANISOU 1605 N PHE A 599 3225 3441 3536 -271 -83 251 N 

ATOM 1606 CA PHE A 599 28.898 24.313 90.190 1.00 24.83 C 

ANISOU 1606 CA PHE A 599 3098 3226 3109 -128 -149 148 C 

ATOM 1607 C PHE A 599 28.790 25.772 90.612 1.00 25.16 C 

ANISOU 1607 C PHE A 599 3151 3265 3146 -82 -21 159 C 

ATOM 1608 O PHE A 599 27.858 26.255 91.265 1.00 25.44 O 

ANISOU 1608 O PHE A 599 3136 3311 3220 -258 27 124 O 

ATOM 1609 CB PHE A 599 28.581 24.102 88.707 1.00 23.88 C 

ANISOU 1609 CB PHE A 599 3038 2995 3041 -53 54 167 C 

ATOM 1610 CG PHE A 599 27.157 24.337 88.304 1.00 23.04 C 

ANISOU 1610 CG PHE A 599 3002 2703 3048 -134 37 107 C 
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ATOM 1611 CD1 PHE A 599 26.700 25.624 88.119 1.00 22.28 C 

ANISOU 1611 CD1 PHE A 599 2986 2648 2832 -161 6 225 C 

ATOM 1612 CD2 PHE A 599 26.248 23.294 88.072 1.00 21.80 C 

ANISOU 1612 CD2 PHE A 599 2787 2692 2805 -102 -18 102 C 

ATOM 1613 CE1 PHE A 599 25.409 25.882 87.742 1.00 22.53 C 

ANISOU 1613 CE1 PHE A 599 2902 2495 3162 -98 48 89 C 

ATOM 1614 CE2 PHE A 599 24.959 23.554 87.677 1.00 21.61 C 

ANISOU 1614 CE2 PHE A 599 2819 2598 2793 17 167 58 C 

ATOM 1615 CZ PHE A 599 24.500 24.850 87.490 1.00 22.64 C 

ANISOU 1615 CZ PHE A 599 3010 2566 3024 -119 -25 95 C 

ATOM 1616 N ASP A 600 29.798 26.540 90.164 1.00 24.19 N 

ANISOU 1616 N ASP A 600 3133 3051 3009 -160 11 107 N 

ATOM 1617 CA ASP A 600 29.844 27.974 90.466 1.00 25.05 C 

ANISOU 1617 CA ASP A 600 3165 3116 3237 -82 89 28 C 

ATOM 1618 C ASP A 600 30.354 28.804 89.316 1.00 23.68 C 

ANISOU 1618 C ASP A 600 3167 2819 3013 3 -47 -52 C 

ATOM 1619 0 ASP A 600 30.542 28.313 88.214 1.00 21.71 0 

ANISOU 1619 0 ASP A 600 2866 2354 3030 -112 188 21 0 

ATOM 1620 CB ASP A 600 30.720 28.231 91.701 1.00 26.96 C 

ANISOU 1620 CB ASP A 600 3507 3477 3260 -144 -28 64 C 

ATOM 1621 CG ASP A 600 32.153 27.782 91.458 1.00 29.38 C 

ANISOU 1621 CG ASP A 600 3562 3804 3796 -31 -28 -17 C 

ATOM 1622 OD1 ASP A 600 32.679 27.806 90.322 1.00 27.07 O 

ANISOU 1622 OD1 ASP A 600 3206 3273 3808 -335 -116 151 O 

ATOM 1623 OD2 ASP A 600 32.835 27.340 92.425 1.00 33.08 O 

Q ANISOU 1623 OD2 ASP A 600 4285 4283 4001 -63 -211 224 O 

ifj ATOM 1624 N TYR A 601 30.623 30.104 89.581 1.00 23.00 N 

i£ ANISOU 1624 N TYR A 601 2881 2784 3074 -181 6 -13 N 

^ ATOM 1625 CA TYR A 601 30.998 30.987 88.477 1.00 23.93 C 

III ANISOU 1625 CA TYR A 601 3108 2869 3115 -23 112 4 C 

m ATOM 1626 C TYR A 601 32.373 30.687 87.901 1.00 22.53 C 

Z:l ANISOU 1626 C TYR A 601 2963 2527 3070 -121 -1 -65 C 

K# ATOM 1627 O TYR A 601 32.586 31.070 86.749 1.00 22.09 O 

JS ANISOU 1627 O TYR A 601 2989 2358 3045 -232 -155 -181 O 

ATOM 1628 CB TYR A 601 30.915 32.464 88.854 1.00 25.47 C 

ANISOU 1628 CB TYR A 601 3374 2902 3402 58 169 -11 C 

1 ATOM 1629 CG TYR A 601 32.106 32.960 89.642 1.00 27.42 C 

0 ANISOU 1629 CG TYR A 601 3456 3295 3669 53 74 -101 C 

Iff ATOM 1630 CD1 TYR A 601 33.002 33.759 88.923 1.00 28.68 C 

?; • ANISOU 1630 CD1 TYR A 601 3624 3501 3772 -29 125 -38 C 

ATOM 1631 CD2 TYR A 601 32.356 32.704 90.971 1.00 27.34 C 

N ANISOU 1631 CD2 TYR A 601 3461 3216 3713 -24 49 -77 C 

p ATOM 1632 CE1 TYR A 601 34.127 34.265 89.545 1.00 31.22 C 

ijl ANISOU 1632 CE1 TYR A 601 3922 3822 4117 -95 -46 -82 C 

' * ATOM 1633 CE2 TYR A 601 33.486 33.223 91.622 1.00 30.06 C 

ANISOU 1633 CE2 TYR A 601 3700 3660 4061 -59 -82 -168 C 

ATOM 1634 CZ TYR A 601 34.345 34.003 90.870 1.00 31.10 C 

ANISOU 1634 CZ TYR A 601 3948 3810 4059 -102 -12 -80 C 

ATOM 1635 OH TYR A 601 35.489 34.534 91.430 1.00 34.40 O 

ANISOU 1635 OH TYR A 601 4128 4249 4695 -335 -50 -160 O 

ATOM 1636 N LYS A 602 33.235 30.034 88.683 1.00 22.59 N 

ANISOU 1636 N LYS A 602 2780 2694 3110 -50 15 -34 N 

ATOM 1637 CA LYS A 602 34.536 29.637 88.115 1.00 23.43 C 

ANISOU 1637 CA LYS A 602 2910 2822 3171 -66 -34 -87 C 

ATOM 1638 C LYS A 602 34.320 28.485 87.131 1.00 21.11 C 

ANISOU 1638 C LYS A 602 2551 2625 2844 -32 -93 124 C 

ATOM 1639 O LYS A 602 34.974 28.404 86.096 1.00 20.00 O 

ANISOU 1639 O LYS A 602 2793 2215 2593 -110 -270 161 O 

ATOM 1640 CB LYS A 602 35.501 29.259 89.238 1.00 25.08 C 

ANISOU 1640 CB LYS A 602 3216 3205 3108 57 -82 42 C 

ATOM 1641 CG LYS A 602 35.763 30.422 90.178 1.00 28.29 C 

ANISOU 1641 CG LYS A 602 3679 3448 3622 -80 -6 -182 C 

ATOM 1642 CD LYS A 602 36.742 30.059 91.294 1.00 31.15 C 

ANISOU 1642 CD LYS A 602 3996 3959 3880 6 -124 103 C 
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ATOM 1643 CE LYS A 602 36.820 31.260 92.239 1.00 34.67 C 

ANISOU 1643 CE LYS A 602 4624 4288 4261 -68 -69 -124 C 

ATOM 1644 NZ LYS A 602 37.947 31.103 93.200 1.00 36.24 N 

ANISOU 1644 NZ LYS A 602 4693 4587 4488 -2 -158 -7 N 

ATOM 1645 N ASP A 603 33.376 27.576 87.467 1.00 21.49 N 

ANISOU 1645 N ASP A 603 2627 2580 2958 -40 -213 32 N 

ATOM 1646 CA ASP A 603 33.041 26.540 86.512 1.00 20.70 C 

ANISOU 1646 CA ASP A 603 2503 2580 2783 -64 -94 64 C 

ATOM 1647 C ASP A 603 32.507 27.199 85.242 1.00 19.84 C 

ANISOU 1647 C ASP A 603 2374 2362 2802 -9 -39 58 C 

ATOM 1648 O ASP A 603 32.747 26.694 84.168 1.00 20.25 O 

ANISOU 1648 0 ASP A 603 2571 2168 2955 22 -34 10 O 

ATOM 1649 CB ASP A 603 31.939 25.602 87.073 1.00 21.75 C 

ANISOU 1649 CB ASP A 603 2753 2565 2946 -117 12 58 C 

ATOM 1650 CG ASP A 603 32.473 24.814 88.259 1.00 24.95 C 

ANISOU 1650 CG ASP A 603 3244 3090 3146 21 -102 103 C 

ATOM 1651 OD1 ASP A 603 33.504 24.124 88.067 1.00 26.87 O 

ANISOU 1651 OD1 ASP A 603 3238 3253 3718 83 -119 288 O 

ATOM 1652 OD2 ASP A 603 31.931 24.832 89.375 1.00 27.25 O 

ANISOU 1652 OD2 ASP A 603 3427 3575 3351 -29 37 71 O 

ATOM 1653 N ALA A 604 31.718 28.290 85.353 1.00 19.54 N 

ANISOU 1653 N ALA A 604 2625 2177 2620 7 -79 4 N 

ATOM 1654 CA ALA A 604 31.201 28.929 84.157 1.00 19.18 C 

ANISOU 1654 CA ALA A 604 2317 2340 2631 244 25 23 C 

ATOM 1655 C ALA A 604 32.294 29.466 83.253 1.00 18.27 C 

p ANISOU 1655 C ALA A 604 2402 2110 2431 113 -28 ~5 C 

.„& ATOM 1656 O ALA A 604 32.274 29.280 82.024 1.00 17.93 O 

ANISOU 1656 O ALA A 604 2357 2005 2452 9 -136 10 O 

ATOM 1657 CB ALA A 604 30.227 30.061 84.549 1.00 18.43 C 

if! ANISOU 1657 CB ALA A 604 2495 2034 2473 172 217 -85 C 

m* ATOM 1658 N ILE A 605 33.357 30.040 83.881 1.00 18.09 N 

2" ANISOU 1658 N ILE A 605 2206 2018 2649 84 -200 101 N 

ATOM 1659 CA ILE A 605 34.524 30.487 83.090 1.00 20.05 C 

II ANISOU 1659 CA ILE A 605 2547 2479 2593 -41 -65 99 C 

%.\ ATOM 1660 C ILE A 605 35.181 29.333 82.349 1.00 20.12 C 

ANISOU 1660 C ILE A 605 2441 2666 2537 -33 -44 42 C 

* ATOM 1661 O ILE A 605 35.406 29.342 81.123 1.00 19.13 O 

0 ANISOU 1661 O ILE A 605 2040 2732 2496 -99 -53 308 O 

,[|f ATOM 1662 CB ILE A 605 35.566 31.179 84.004 1.00 22.12 C 

7] ANISOU 1662 CB ILE A 605 2722 2902 2781 -53 -161 -18 C 

ATOM 1663 CGI ILE A 605 34.967 32.459 84.578 1.00 23.89 C 

SJ ANISOU 1663 CGI ILE A 605 3070 2951 3055 7 -153 -50 C 

rl ATOM 1664 CG2 ILE A 605 36.839 31.425 83.206 1.00 22.71 C 

H ANISOU 1664 CG2 ILE A 605 2780 2808 3039 -145 -60 34 C 

^* ATOM 1665 CD1 ILE A 605 35.732 32.980 85.815 1.00 25.66 C 

ANISOU 1665 CD1 ILE A 605 3420 3243 3087 -53 -191 -125 C 

ATOM 1666 N ARG A 606 35.368 28.196 83.040 1.00 20.12 N 

ANISOU 1666 N ARG A 606 2448 2560 2637 -79 -51 69 N 

ATOM 1667 CA ARG A 606 35.949 27.000 82.408 1.00 20.36 C 

ANISOU 1667 CA ARG A 606 2577 2493 2667 9 -50 121 C 

ATOM 1668 C ARG A 606 35.010 26.402 81.374 1.00 19.06 C 

ANISOU 1668 C ARG A 606 2646 2122 2474 113 -19 254 C 

ATOM 1669 O ARG A 606 35.447 25.822 80.366 1.00 19.14 O 

ANISOU 1669 O ARG A 606 2476 2306 2493 98 -42 176 O 

ATOM 1670 CB ARG A 606 36.295 25.912 83.473 1.00 22.72 C 

ANISOU 1670 CB ARG A 606 2998 2540 3096 272 -55 189 C 

ATOM 1671 CG ARG A 606 31.308 26.410 84.479 1.00 26.58 C 

ANISOU 1671 CG ARG A 606 3415 3256 3430 66 -151 11 C 

ATOM 1672 CD ARG A 606 38.035 25.267 85.256 1.00 31.19 C 

ANISOU 1672 CD ARG A 606 3774 3958 4120 416 -263 239 C 

ATOM 1673 NE ARG A 606 38.633 25.996 86.406 1.00 36.26 N 

ANISOU 1673 NE ARG A 606 4700 4591 4486 44 -222 -63 N 

ATOM 1674 CZ ARG A 606 38.021 26.191 87.579 1.00 38.29 C 

ANISOU 1674 CZ ARG A 606 4854 4946 4750 102 8 -24 C 
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ATOM 1675 NH1 ARG A 606 36.817 25.729 87.869 1.00 37.81 N 

ANISOU 1675 NH1 ARG A 606 4845 4766 4756 88 -69 72 N 

ATOM 1676 NH2 ARG A 606 38.655 26.909 88.513 1.00 40.36 N 

ANISOU 1676 NH2 ARG A 606 5115 5185 5034 24 -103 -129 N 

ATOM 1677 N MET A 607 33.698 26.515 SI. 636 1.00 18.65 N 

ANISOU 1677 N MET A 607 2407 2017 2662 -87 -177 122 N 

ATOM 1678 CA MET A 607 32.727 25.913 80,700 1.00 17.45 C 

ANISOU 1678 CA MET A 607 2118 2046 2467 66 -29 -52 C 

ATOM 1679 C MET A 607 32.860 26.493 79.316 1.00 17.15 C 

ANISOU 1679 C MET A 607 2198 1824 2492 20 -17 -29 C 

ATOM 1680 O MET A 607 32.678 25.805 78,280 1.00 17.55 O 

ANISOU 1680 O MET A 607 2300 1799 2569 195 -146 14 O 

ATOM 1681 CB MET A 607 31.296 26.054 81.261 1.00 17.74 C 

ANISOU 1681 CB MET A 607 2194 2041 2506 17 -12 -50 C 

ATOM 1682 CG MET A 607 30.294 25.283 80.411 1.00 18.24 C 

ANISOU 1682 CG MET A 607 2330 1868 2732 14 -22 -86 C 

ATOM 1683 SD MET A 607 30.433 23.522 80.870 1.00 21.00 S 

ANISOU 1683 SD MET A 607 2754 1772 3455 20 274 11 S 

ATOM 1684 CE MET A 607 28.922 22.888 80.130 1.00 23.17 C 

ANISOU 1684 CE MET A 607 3229 2264 3312 -110 -4 -70 C 

ATOM 1685 N ARG A 608 33.029 27.838 79.234 1.00 16.78 N 

ANISOU 1685 N ARG A 608 2217 1769 2391 -179 -19 49 N 

ATOM 1686 CA ARG A 608 33.181 28.429 77.908 1.00 17.63 C 

ANISOU 1686 CA ARG A 608 2363 1974 2360 30 30 -43 C 

^ ATOM 1687 C ARG A 608 34.321 27.799 77.119 1.00 17.62 C 

U ANISOU 1687 C ARG A 608 2356 1929 2408 -79 -55 61 C 

*Q ATOM 1688 O ARG A 608 34.190 27.502 75.925 1.00 17.32 O 

:54 ANISOU 1688 O ARG A 608 2299 1917 2366 252 3 66 O 

^ ATOM 1689 CB ARG A 608 33.449 29.942 77.991 1.00 17.88 C 

y) ANISOU 1689 CB ARG A 608 2112 2023 2658 -163 54 -138 C 

fl! ATOM 1690 CG ARG A 608 32.229 30.739 78.460 1.00 18.98 C 

S| ANISOU 1690 CG ARG A 608 2551 1958 2703 128 -14 11 C 

*2 ATOM 1691 CD ARG A 608 32.666 32.253 78.576 1.00 17.55 C 

# ANISOU 1691 CD ARG A 608 2249 1946 2474 -71 193 -106 C 

S| ATOM 1692 NE ARG A 608 31.394 32.915 78.980 1.00 17.14 N 

a ANISOU 1692 NE ARG A 608 2186 1846 2482 97 -144 -33 N 

ATOM 1693 CZ ARG A 608 30.504 33.396 78.114 1.00 15.88 C 

W ANISOU 1693 CZ ARG A 608 2051 1640 2343 97 109 27 C 

HI ATOM 1694 NHl ARG A 608 30.772 33.443 76.823 1.00 15.44 N 

y, ANISOU 1694 NHl ARG A 608 2086 1450 2331 -66 77 64 N 

I B ATOM 1695 NH2 ARG A 608 . 29.340 33.848 78.621 1.00 16.41 N 

ANISOU 1695 NH2 ARG A 608 1942 1619 2673 98 179 -61 N 

p ATOM 1696 N ARG A 609 35.453 27.620 77.789 1.00 18.06 N 

ANISOU 1696 N ARG A 609 2279 2080 2503 -8 18 -48 N 

ATOM 1697 CA ARG A 609 36.611 27.001 77.142 1.00 17.41 C 

ANISOU 1697 CA ARG A 609 2022 2049 2543 13 56 1 C 

ATOM 1698 C ARG A 609 36.359 25.524 76.856 1.00 17.72 C 

ANISOU 1698 C ARG A 609 2243 2089 2400 -52 49 16 C 

ATOM 1699 O ARG A 609 36.785 25.068 75.798 1.00 17.83 O 

ANISOU 1699 O ARG A 609 2305 1896 2575 -72 73 -206 O 

ATOM 1700 CB ARG A 609 37.832 27.155 78.056 1.00 18.95 C 

ANISOU 1700 CB ARG A 609 2357 2325 2517 51 -52 31 C 

ATOM 1701 CG ARG A 609 38.083 28.575 78.532 1.00 21.70 C 

ANISOU 1701 CG ARG A 609 2856 2580 2809 -112 68 -75 C 

ATOM 1702 CD ARG A 609 38.122 29.614 77.417 1.00 21.87 C 

ANISOU 1702 CD ARG A 609 2675 2615 3020 106 13 73 C 

ATOM 1703 NE ARG A 609 39.401 29.723 76.724 1.00 24.82 N 

ANISOU 1703 NE ARG A 609 2951 3140 3338 -90 75 -19 N 

ATOM 1704 CZ ARG A 609 39.663 30.813 75.980 1.00 25.69 C 

ANISOU 1704 CZ ARG A 609 3153 3218 3391 19 52 56 C 

ATOM 1705 NHl ARG A 609 40.843 30.889 75.381 1.00 26.94 N 

ANISOU 1705 NHl ARG A 609 3289 3403 3543 -132 194 -78 N 

ATOM 1706 NH2 ARG A 609 38.761 31.798 75.850 1.00 25.09 N 

ANISOU 1706 NH2 ARG A 609 3025 3077 3430 -96 172 64 N 
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ATOM 1707 N PHE A 610 35.615 24.839 77.721 1.00 17.97 N 

ANISOU 1707 N PHE A 610 2169 2177 2481 -42 71 116 N 

ATOM 1708 CA PHE A 610 35.353 23.406 77.523 1.00 17.48 C 

ANISOU 1708 CA PHE A 610 2024 2123 2495 162 75 21 C 

ATOM 1709 C PHE A 610 34.494 23.220 76.298 1.00 17.42 C 

ANISOU 1709 C PHE A 610 2114 2179 2325 66 201 -2 C 

ATOM 1710 O PHE A 610 34.816 22.420 75.417 1.00 18.23 O 

ANISOU 1710 O PHE A 610 2474 2083 2370 229 142 65 O 

ATOM 1711 CB PHE A 610 34.683 22.863 78.783 1.00 18.11 C 

ANISOU 1711 CB PHE A 610 2222 2089 2570 217 122 94 C 

ATOM 1712 CG PHE A 610 34.227 21.417 78.689 1.00 18.67 C 

ANISOU 1712 CG PHE A 610 2564 2220 2310 -39 55 219 C 

ATOM 1713 CD1 PHE A 610 35.118 20.454 78.281 1.00 20.71 C 

ANISOU 1713 CD1 PHE A 610 2821 2309 2737 98 24 157 C 

ATOM 1714 CD2 PHE A 610 32.915 21.063 79.003 1.00 21.16 C 

ANISOU 1714 CD2 PHE A 610 2602 2602 2834 -148 56 104 C 

ATOM 1715 CE1 PHE A 610 34.719 19.116 78.219 1.00 21.69 C 

ANISOU 1715 CE1 PHE A 610 2966 2382 2894 32 -56 122 C 

ATOM 1716 CE2 PHE A 610 32.547 19.718 78.977 1.00 22.54 C 

ANISOU 1716 CE2 PHE A 610 3038 2531 2997 -67 238 29 C 

ATOM 1717 CZ PHE A 610 33.426 18.739 78.552 1.00 21.82 C 

ANISOU 1717 CZ PHE A 610 2836 2503 2951 73 43 209 C 

ATOM 1718 N ILE A 611 33.380 23.998 76.222 1.00 16.24 N 

ANISOU 1718 N ILE A 611 2012 1977 2182 127 16 228 N 

ATOM 1719 CA ILE A 611 32.527 23.864 75.035 1.00 16.24 C 

PJ ANISOU 1719 CA ILE A 611 1939 1952 2280 201 12 -37 C 

ATOM 1720 C ILE A 611 33.310 24.212 73.785 1.00 16.58 C 

^ ANISOU 1720 C ILE A 611 2108 1858 2332 115 30 -64 C 

&) ATOM 1721 O ILE A 611 33.189 23.530 72.756 1.00 17.64 O 

IH ANISOU 1721 O ILE A 611 2382 2036 2285 181 -4 -69 O 

^ ATOM 1722 CB ILE A 611 31.255 24.731 75.132 1.00 15.45 C 

J : ANISOU 1722 CB ILE A 611 1902 1920 2050 285 104 -116 C 

III ATOM 1723 CGI ILE A 611 30.431 24.331 76.354 1.00 16.00 C 

jg ANISOU 1723 CGI ILE A 611 1929 2044 2105 247 56 -62 C 

*i ATOM 1724 CG2 ILE A 611 30.424 24.655 73.846 1.00 16.68 C 

N ANISOU 1724 CG2 ILE A 611 2073 2138 2127 41 70 -19 C 

* ATOM 1725 CD1 ILE A 611 29.280 25.311 76.605 1.00 16.86 C 

O ANISOU 1725 CD1 ILE A 611 2262 2219 1925 401 -30 -205 C 

sll ATOM 1726 N ALA A 612 34.115 25.268 73.772 1.00 18.62 N 

¥ l ANISOU 1726 N ALA A 612 2225 2017 2834 152 170 -39 N 

H ATOM 1727 CA ALA A 612 34.911 25.624 72.624 1.00 18.38 C 

\J ANISOU 1727 CA ALA A 612 2398 2081 2504 55 68 -93 C 

j=6 ATOM 1728 C ALA A 612 35.832 24.451 72.239 1.00 19.28 C 

ANISOU 1728 C ALA A 612 2371 2340 2613 185 70 -14 C 

N ATOM 1729 O ALA A 612 35.929 24.157 71.030 1.00 19.78 O 

ANISOU 1729 O ALA A 612 2372 2552 2594 143 312 52 O 

ATOM 1730 CB ALA A 612 35.736 26.885 72.841 1.00 19.07 C 

ANISOU 1730 CB ALA A 612 2617 2230 2400 -100 30 -45 C 

ATOM 1731 N HIS A 613 36.474 23.821 73.221 1.00 19.98 N 

ANISOU 1731 N HIS A 613 2309 2476 2806 124 9 62 N 

ATOM 1732 CA HIS A 613 37.329 22.651 72.937 1.00 20.47 C 

ANISOU 1732 CA HIS A 613 v 2369 2469 2941 18 80 -62 C 

ATOM 1733 C HIS A 613 36.547 21.510 72.290 1.00 21.13 C 

ANISOU 1733 C HIS A 613 2664 2544 2821 -1 -62 -23 C 

ATOM 1734 O HIS A 613 37.060 20.814 71.381 1.00 21.74 O 

ANISOU 1734 O HIS A 613 2330 2938 2991 14 83 17 O 

ATOM 1735 CB HIS A 613 37.903 22.187 74.295 1.00 22.31 C 

ANISOU 1735 CB HIS A 613 2631 2841 3003 47 14 -116 C 

ATOM 1736 CG HIS A 613 38.624 20.871 74.229 1.00 24.90 C 

ANISOU 1736 CG HIS A 613 3020 3051 3390 203 -67 -22 C 

ATOM 1737 ND1 HIS A 613 39.879 20.744 73.679 1.00 27.12 N 

ANISOU 1737 ND1 HIS A 613 3270 3437 3598 175 104 -32 N 

ATOM 1738 CD2 HIS A 613 38.217 19.650 74.631 1.00 26.89 C 

ANISOU 1738 CD2 HIS A 613 3397 3212 3608 156 -67 101 C 
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ATOM 1739 CE1 HIS A 613 40.233 19.464 73.770 1.00 27.29 C 

ANISOU 1739 CE1 HIS A 613 3385 3415 3569 128 -5 -39 C 

ATOM 1740 NE2 HIS A 613 39.255 18.777 74.316 1.00 28.03 N 

ANISOU 1740 NE2 HIS A 613 3602 3425 3622 209 134 33 N 

ATOM 1741 N LEU A 614 35:327 21.253 72.788 1.00 19.66 N 

ANISOU 1741 N LEU A 614 2392 2269 2811 80 -163 19 N 

ATOM 1742 CA LEU A 614 34.491 20.168 72,213 1.00 20.01 C 

ANISOU 1742 CA LEU A 614 2508 2301 2793 35 128 -122 C 

ATOM 1743 C LEU A 614 34.231 20.439 70.750 1.00 19.65 C 

ANISOU 1743 C LEU A 614 2353 2291 2821 107 89 -76 C 

ATOM 1744 O LEU A 614 34.239 19.536 69.893 1.00 22,24 O 

ANISOU 1744 O LEU A 614 2787 2573 3090 197 110 -310 O 

ATOM 1745 CB LEU A 614 33.169 19.989 72.985 1.00 19.74 C 

ANISOU 1745 CB LEU A 614 2497 2272 2730 44 63 -50 C 

ATOM 1746 CG LEU A 614 33.308 19.558 74.459 1.00 19.67 C 

ANISOU 1746 CG LEU A 614 2396 2295 2784 215 83 -35 C 

ATOM 1747 CD1 LEU A 614 31.949 19.510 75.146 1.00 19.67 C 

ANISOU 1747 CD1 LEU A 614 2708 2145 2621 -57 207 10 C 

ATOM 1748 CD2 LEU A 614 33.944 18.152 74.506 1.00 20.94 C 

ANISOU 1748 CD2 LEU A 614 3022 2012 2923 15 120 40 C 

ATOM 1749 N ILE A 615 33.965 21.698 70.391 1.00 18.12 N 

ANISOU 1749 N ILE A 615 2168 2250 2465 -32 144 31 N 

ATOM 1750 CA ILE A 615 33.720 22.059 68.995 1.00 18.12 C 

ANISOU 1750 CA ILE A 615 2398 2077 2409 99 85 -72 C 

ATOM 1751 C ILE A 615 34.967 21.831 68.146 1.00 20.48 C 

ANISOU 1751 C ILE A 615 2572 2621 2588 -51 178 -32 C 

^ ATOM 1752 O ILE A 615 34.860 21.203 67.080 1.00 22.69 O 

@ ANISOU 1752 O ILE A 615 2927 2694 3001 -15 307 -278 O 

S3 ATOM 1753 CB ILE A 615 33.263 23.546 68.925 1.00 18.30 C 

?Jt ANISOU 1753 CB ILE A 615 2496 2017 2441 25 86 -122 C 

ATOM 1754 CGI ILE A 615 31.880 23.653 69.547 1.00 17.52 C 

ANISOU 1754 CGI ILE A 615 2422 1987 2248 -41 134 -141 C 

Vd ATOM 1755 CG2 ILE A 615 33.228 23.975 67.481 1.00 20.24 C 

r s ANISOU 1755 CG2 ILE A 615 2754 2517 2418 88 45 -112 C 

y*, ATOM 1756 CD1 ILE A 615 31.356 25.064 69.831 1.00 18.55 C 

^ ANISOU 1756 CD1 ILE A 615 2472 2053 2523 56 20 -102 C 

* ATOM 1757 N LEU A 616 36.102 22.327 68.598 1.00 21.45 N 

f) ANISOU 1757 N LEU A 616 2581 2872 2696 -49 166 -217 N 

^ ATOM 1758 CA LEU A 616 37.342 22.218 67.795 1.00 22.75 C 

~ 5 ANISOU 1758 CA LEU A 616 2616 3016 3014 95 242 -179 C 

M* ATOM 1759 C LEU A 616 37.803 20.779 67.676 1.00 24.30 C 

hi ANISOU 1759 C LEU A 616 3075 3016 3144 87 131 -173 C 

ATOM 1760 O LEU A 616 38.525 20.439 66.709 1.00 27.12 O 

^ ANISOU 1760 O LEU A 616 3319 3620 3367 101 421 -246 O 

M= ATOM 1761 CB LEU A 616 38.463 23.090 68.401 1.00 24.20 C 

ANISOU 1761 CB LEU A 616 2926 3134 3137 24 79 -87 C 

ATOM 1762 CG LEU A 616 38.143 24.601 68.424 1.00 23.96 C 

ANISOU 1762 CG LEU A 616 2932 3094 3077 -67 43 -97 C 

ATOM 1763 CD1 LEU A 616 39.340 25.345 69.003 1.00 24.71 C 

ANISOU 1763 CD1 LEU A 616 2827 3304 3257 -99 143 -95 C 

ATOM 1764 CD2 LEU A 616 37.687 25.127 67.070 1.00 24.18 C 

ANISOU 1764 CD2 LEU A 616 2823 3353 3010 -112 183 -93 C 

ATOM 1765 N THR A 617 37.465 19.938 68.624 1.00 24.70 N 

ANISOU 1765 N THR A 617 3019 3206 3160 143 74 -76 N 

ATOM 1766 CA THR A 617 37.874 18.535 68.575 1.00 26.86 C 

ANISOU 1766 CA THR A 617 3444 3234 3529 140 6 -81 C 

ATOM 1767 C THR A 617 36.800 17.638 67.990 1.00 27.41 C 

ANISOU 1767 C THR A 617 3561 3318 3537 100 -42 -81 C 

ATOM 1768 O THR A 617 36.922 16.401 68.061 1.00 27.99 O 

ANISOU 1768 O THR A 617 3683 3287 3663 205 22 49 O 

ATOM 1769 CB THR A 617 38.363 18.063 69.958 1.00 27.91 C 

ANISOU 1769 CB THR A 617 3614 3416 3574 44 0 33 C 

ATOM 1770 OG1 THR A 617 37.296 18.211 70.880 1.00 27.25 O 

ANISOU 1770 OG1 THR A 617 3519 3134 3702 273 -46 -40 O 
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ATOM 1771 CG2 THR A 617 39.576 18.891 70.379 1.00 28.06 C 

ANISOU 1771 CG2 THR A 617 3685 3282 3695 105 -26 -73 C 

ATOM 1772 N ASP A 618 35.777 18.223 67.367 1.00 26.05 N 

ANISOU 1772 N ASP A 618 3386 3188 3323 184 145 -162 N 

ATOM 1773 CA ASP A 618 34.752 17.468 66.657 1.00 26.29 C 

ANISOU 1773 CA ASP A 618 3300 3247 3443 103 141 -90 C 

ATOM 1774 C ASP A 618 34.217 16.378 67.564 1.00 26.42 C 

ANISOU 1774 C ASP A 618 3335 3310 3394 -1 36 -48 C 

ATOM 1775 O ASP A 618 34.106 15.184 67.231 1.00 28.47 O 

ANISOU 1775 O ASP A 618 3766 3429 3621 97 265 -221 O 

ATOM 1776 CB ASP A 618 35.387 16.865 65.402 1.00 27.42 C 

ANISOU 1776 CB ASP A 618 3593 3470 3355 18 124 -86 C 

ATOM 1777 CG ASP A 618 34.473 16.394 64.307 1.00 29.38 C 

ANISOU 1777 CG ASP A 618 3695 3827 3643 64 20 -170 C 

ATOM 1778 OD1 ASP A 618 33.272 16.690 64.259 1.00 28.30 O 

ANISOU 1778 OD1 ASP A 618 3575 3617 3561 46 30 -177 O 

ATOM 1779 OD2 ASP A 618 35.013 15.712 63.385 1.00 30.41 O 

ANISOU 1779 OD2 ASP A 618 3908 3839 3806 114 83 -324 O 

ATOM 1780 N ALA A 619 33.736 16.776 68.731 1.00 26.15 N 

ANISOU 1780 N ALA A 619 3280 3141 3515 -6 105 -129 N 

ATOM 1781 CA ALA A 619 33.275 15.864 69.770 1.00 27.62 C 

ANISOU 1781 CA ALA A 619 3633 3383 3477 81 45 -32 C 

ATOM 1782 C ALA A 619 32,068 15.032 69.419 1.00 27.85 C 

ANISOU 1782 C ALA A 619 3762 3295 3526 73 63 -134 C 

^ ATOM 1783 O ALA A 619 31.819 14.056 70.136 1.00 28.62 O 

P ANISOU 1783 O ALA A 619 4120 3361 3392 -16 121 -227 O 

yp ATOM 1784 CB ALA A 619 33.022 16.646 71.066 1.00 29.09 C 

*?f ANISOU 1784 CB ALA A 619 4019 3553 3483 52 91 -75 C 

TJs ATOM 1785 N LEU A 620 31.326 15.357 68.371 1.00 26.99 N 

ANISOU 1785 N LEU A 620 3593 3232 3429 44 102 -257 N 

01 ATOM 1786 CA LEU A 620 30.151 14.556 68.024 1.00 27.73 C 

m ANISOU 1786 CA LEU A 620 3608 3365 3564 -6 36 -209 C 

S 2 ATOM 1787 C LEU A 620 30.529 13.416 67.070 1.00 30.37 C 

4* ANISOU 1787 C LEU A 620 4115 3680 3744 20 96 -361 C 

%J ATOM 1788 O LEU A 620 29.596 12.684 66.711 1.00 31.58 O 

ANISOU 1788 O LEU A 620 4099 3835 4064 -36 218 -362 O 

^ ATOM 1789 CB LEU A 620 29.050 15.406 67.434 1.00 26.26 C 

M ANISOU 1789 CB LEU A 620 3465 3153 3361 -103 93 -317 C 

Hi ATOM 1790 CG LEU A 620 28.679 16.652 68.271 1.00 25.84 C 

ANISOU 1790 CG LEU A 620 3365 3163 3290 44 50 -245 C 

f' s ATOM 1791 CD1 LEU A 620 27.702 17.486 67.465 1.00 25.51 C 

ANISOU 1791 CD1 LEU A 620 3310 3188 3196 -106 7 -167 C 

Q ATOM 1792 CD2 LEU A 620 28.148 16.243 69.630 1.00 26.24 C 

y, ANISOU 1792 CD2 LEU A 620 3379 3280 3310 -85 -37 -155 C 

ATOM 1793 N LYS A 621 31.796 13.208 66.768 1.00 32.72 N 

ANISOU 1793 N LYS A 621 4175 3990 4266 50 99 -65 N 

ATOM 1794 CA LYS A 621 32.184 12.071 65.922 1.00 36.62 C 

ANISOU 1794 CA LYS A 621 4758 4481 4676 149 201 -311 C 

ATOM 1795 C LYS A 621 31.137 11.666 64.901 1.00 38.99 C 

ANISOU 1795 C LYS A 621 4958 4897 4960 54 19 -146 C 

ATOM 1796 O LYS A 621 31.105 10.428 64.598 1.00 41.65 O 

ANISOU 1796 O LYS A 621 5441 4896 5488 39 90 -189 O 

ATOM ■ 1797 CB LYS A 621 32.540 10.823 66.748 1.00 39.07 C 

ANISOU 1797 CB LYS A 621 5043 4693 5111 159 37 -98 C 

ATOM 1798 CG LYS A 621 31.583 10.196 67.684 1.00 42.00 C 

ANISOU 1798 CG LYS A 621 5286 5345 5328 30 128 -25 C 

ATOM 1799 CD LYS A 621 31.797 8.768 68.147 1.00 44.44 C 

ANISOU 1799 CD LYS A 621 5734 5463 5688 60 47 36 C 

ATOM 1800 CE LYS A 621 30.905 7.777 67.425 1.00 45.57 C 

ANISOU 1800 CE LYS A 621 5760 5741 5813 -48 20 10 C 

ATOM 1801 NZ LYS A 621 31.375 6.374 67.661 1.00 46.21 N 

ANISOU 1801 NZ LYS A 621 5891 5768 5900 -2 -48 -29 N 

TER 1802 LYS A 621 

ATOM 1803 N PRO B 20 5.110 13.180 40.537 1.00 61.48 N 
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